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Title of the invention 

HERBICIDE TOLERANT COTTON PLANTS AND METHODS FOR 
PRODUCING AND IDENTIFYING SAME 

5 

Field of the invention 

This invention pertains to transgenic cotton plants, plant material and seeds, characterized 
by harboring a specific transformation event, particularly by the presence of a gene 
10 encoding a protein that confers herbicide tolerance, at a specific location in the cotton 
genome. The cotton plants of the invention combine the herbicide tolerant phenotype 
with an agronomic performance, genetic stability and adaptability to different genetic 
backgrounds equivalent to the non-transformed cotton line in the absence of weed 
pressure. 

15 

All documents cited herein are hereby incorporated herein by reference. 

Background of the invention 

20 The phenotypic expression of a transgene in a plant is determined both by the structure of 
the gene itself and by its location in the plant genome. At the same time the presence of 
the transgene at different locations in the genome will influence the overall phenotype of 
the plant in different ways. The agronomically or industrially successful introduction of a 
commercially interesting trait in a plant by genetic manipulation can be a lengthy 

25 procedure dependent on different factors. The actual transformation and regeneration of 
genetically transformed plants are only the first in a series of selection steps, which 
include extensive genetic characterization, breeding, and evaluation in field trials. 

Cotton fiber is the single most important textile worldwide. About 80 million acres of 
30 cotton are harvested annually across the globe. Cotton is the fifth largest crop in the U.S. 
in terms of acreage production, with over 15 million acres planted in 2000. Primary weed 

l 
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species for cotton are Ipomoea sp. (morning glory), Amaranthus spp. (pigweed), Cyperus 
spp. (nutsedge), Xanthium spp. (cocklebur) and Sorghum spp. (johnsongrass). Before the 
introduction of broad-leaf herbicides that could be used on a growing cotton field, 
growers used directed, post-emergence applications of nonselective herbicides taking 
5 care not to contact the growing crop plants. As this requires a difference in height 
between the weeds and the crop, this is not always possible. Especially for small cotton, 
this practice is time-consuming and potentially damaging to the crop. 

The bar gene (Thompson et al, 1987, EMBO J 6:2519-2523; Deblock et al 1987, EMBO 
10 J. 6:2513-2518) is a gene encoding the enzyme phosphinothricin acetyl transferase 
(PAT), which, when expressed in a plant, confers resistance to the herbicidal compounds 
phosphinothricin (also called glufosinate) or bialaphos (see also for example US patents, 
5,646,024 and 5,561,236) and salts and optical isomers thereof. Phosphinothricin controls 
broadleaf weeds including morning glory and has a wide window of application. 

15 

Successful genetic transformation of cotton has been obtained by a number of methods 
including Agrobacterium infection of cotton explants (Firoozabady et al. 1987, Plant 
Molecular Biology 10:105-116; Umbeck et al. 1987, Bio/Technology 5:263-266 and in 
WO 00/71733, US 5,004,863, and US 5,159,135), as well as direct gene transfer by 

20 microprojectile bombardment of meristematic cotton tissues (Finer and Mc Mullen, 1990, 
Plant Cell Reports, 5:586-589; McCabe and Martinell, 1993, Bio/Technology 11:596- 
598, W092/15675, EP0 531 506). Increased transformation efficiency for Agrobacterium 
transformation has been reported using the methods described by Hansen et al. (1994, 
Proc. Nat. Acad. Sci. 91:7603-7607) Veluthambi et al. (1989, Journal of Bacteriology 

25 171:3696-3703) and WO 00/71733. 

Different methods for regeneration of cotton plants have also been described (WO 
89/05344, US 5,244,802, US 5,583,036, WO89/12102, W098/15622, and W097/12512). 

However, the foregoing documents fail to teach or suggest the present invention. 

30 
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Summary of the invention 

The present invention relates to a transgenic cotton plant, or seed, cells or tissues thereof, 
comprising, stably integrated into its genome, an expression cassette which comprises a 
5 herbicide tolerance gene comprising the coding sequence of the bar gene (as described in 
Example 1.1 herein), which is herbicide tolerant and, in the absence of weed pressure, has 
an agronomic performance which is substantially equivalent to the non-transgenic isoline. 
Under weed pressure and the appropriate Liberty™ treatment, the plant will have a 
superior agronomic phenotype compared to the non-transgenic plant. 

10 

In one embodiment of the invention, the cotton plant or seed, cells or tissues thereof, 
comprises the expression cassette of pGSV71 (as described in Example 1.1, Table 1 
herein). In the preferred embodiment of the invention the cotton plant or seed, cells or 
tissues thereof comprise elite event EE-GHL 

15 

In another embodiment of the invention, the transgenic cotton plant or seed, cells or 
tissues thereof comprises: 

(i) event EE-GH1 in its genome ;or 

(ii) event EE-GH1 with the proviso that the bar gene used in the event is 
20 substituted with a nucleic acid sequence that hybridizes to the complement of the bar 

gene under stringent conditions. 

More specifically, the present invention relates to a transgenic cotton plant, seed, cells or 
tissues thereof, the genomic DNA of which is characterized by the fact that, when 

25 analyzed in a PCR identification protocol as described herein, using two primers directed 
to the 5' or 3' flanking region of EE-GH1 and the foreign DNA, respectively, yields a 
fragment which is specific for EE-GH1. Preferably the primers are directed against the 5' 
flanking region within SEQ ID NO: 1 and the foreign DNA respectively; most preferably, 
the primers comprise the nucleotide sequence of SEQ ID NO: 2 and SEQ ID NO: 3 

30 respectively, and yield a DNA fragment of between 250 and 290 bp, preferably of about 
269 bp. 
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Reference seed comprising the elite event of the invention has been deposited at the 
ATCC under accession number PTA-3343. Thus, a preferred embodiment of the 
invention is the seed comprising elite event EE-GH1 deposited as ATTC accession 

5 number PTA-3343, which will grow into a cotton plant resistant to glufosinate. The seed 
of ATCC deposit number PTA-3343, which is a seed lot consisting of about 50% non- 
transgenic kernels and 50% transgenic kernels hemizygous for the transgene, comprising 
the elite event of the invention, which will grow into glufosinate tolerant plants. The seed 
can be sown and the growing plants can be treated with PPT or Liberty™ as described 

10 herein to obtain 100% glufosinate tolerant plants, comprising the elite event of the 
invention. The invention further relates to cells, tissues, progeny, and descendants from a 
plant comprising the elite event of the invention grown from the seed deposited at the 
ATCC having accession number PTA-3343. The invention further relates to plants 
obtainable by propagation of and/or breeding with a cotton plant comprising the elite 

15 event of the invention grown from the seed deposited at the ATCC having accession 
number PTA-3343. 

The invention further relates to plants, seeds, cells or tissues comprising a foreign DNA 
sequence, preferably a herbicide tolerance gene as described herein, integrated into the 
20 chromosomal DNA in a region which comprises the plant DNA sequence of SEQ ID NO: 
1 and/or SEQ ID NO: 4, more particularly which comprises the DNA sequence of SEQ 
ID NO: 5, or a sequence which hybridizes under stringent conditions to a sequence that is 
complementary to a sequence comprising the plant DNA sequence of SEQ ID NO: 1, 
SEQ ID NO: 4 and/or SEQ ID NO: 5. 

25 

The invention further provides a process for producing a transgenic cell of a cotton plant, 
which comprises inserting a recombinant DNA molecule into a region of the 
chromosomal DNA of a cotton cell, tissue or callus which comprises the plant DNA 
sequence of SEQ ID NO: 1 and/or SEQ ID NO: 4, or which comprises a sequence which 
30 hybridizes under stringent conditions to a sequence that is complementary to a sequence 
comprising the plant DNA sequence of SEQ ID NO: 1 and/or SEQ ID NO: 4. 
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The invention further relates to a method for identifying a transgenic plant, or cells or 
tissues thereof, comprising elite event EE-GH1 which method is based on identifying the 
presence of characterizing DNA sequences or amino acids encoded by such DNA 
5 sequences in the transgenic plant, cells or tissues. According to a preferred embodiment 
of the invention, such characterizing DNA sequences are sequences of 15bp, preferably 
20bp, most preferably 30bp or more which comprise the insertion site of the event, i.e. 
both a part foreign DNA and a part of the cotton genome (either the 5' or 3* flanking 
region) contiguous therewith, allowing specific identification of the elite event. 

10 

According to another aspect of the invention, DNA sequences are disclosed comprising 
the insertion site of the event and sufficient length of polynucleotides of both the cotton 
genomic DNA and the foreign DNA (transgene), so as to be useful as primer or probe for 
the detection of EE-GH1. Preferably, such sequences comprise at least 9 nucleotides of 
15 the cotton genomic DNA and a similar number of nucleotides of the foreign DNA 
(transgene) of EE-GH1 therewith at each side of the insertion site respectively. Most 
preferably, such DNA sequences comprise at least 9 nucleotides of the cotton genomic 
DNA and a similar number of nucleotides of the foreign DNA contiguous with the 
insertion site in SEQ ID NO: 1 or SEQ ID NO: 4. 

20 

According to another preferred aspect of the invention, the method for identifying a 
transgenic plant, or cells or tissues thereof, comprising elite event EE-GH1, comprises 
amplifying a sequence of a nucleic acid present in biological samples, using a polymerase 
chain reaction, with at least two primers, one of which recognizes the plant DNA in the 5' 

25 or 3' flanking region of EE-GH1, the other which recognizes a sequence within the 
foreign DNA. Preferably, the genomic DNA is analyzed using primers which recognize a 
sequence within the plant 5' flanking region of EE-GH1, most preferably within the plant 
DNA sequence in SEQ ID NO: 1 , and a sequence within the foreign DNA, respectively. 
Especially preferably, the genomic DNA is analyzed according to the PCR identification 

30 protocol described herein whereby the primer recognizing a sequence within the 5' 
flanking region comprises the nucleotide sequence of SEQ ID NO: 2. Particularly, the 
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primer recognizing a sequence within the 5' flanking region comprises the nucleotide 
sequence of SEQ ID NO: 2 and the primer recognizing a sequence within the foreign 
DNA comprises the nucleotide sequence of SEQ ID NO: 3, so that the amplified 
fragment is a fragment preferably of between 250 and 290 bp, more preferably of 
5 between 260 and 270 bp, most preferably of about 269 bp. 

Accordingly, the present invention relates to the transgenic plant, cells or tissues thereof 
which can be identified according the above-described identification methods for EE- 
GH1. 

10 

The present invention relates to methods for identifying elite event EE-GH1 in biological 
samples, which methods are based on primers or probes that specifically recognize the 5' 
and/or 3' flanking sequence of EE-GH1. In a preferred embodiment of the invention 
these methods are based on primers or probes which recognize a sequence within SEQ ID 

15 NO: 1 and/or SEQ ID NO: 4, more particularly primers or probes comprising the 
sequence of SEQ ID NO: 2. According to another embodiment of the invention such 
methods are based on primers or probes which recognize a characterizing sequence of 
EE-GH1, which is a DNA sequence comprising the insertion site of the event, including 
sufficient nucleotides of the 5' or 3' flanking DNA and the foreign DNA contiguous 

20 therewith so as to be diagnostic for the event. Most particularly, such methods are based 
on sequences comprising sufficient nucleotides of the 5' or 3' flanking DNA and the 
foreign DNA contiguous therewith of SEQ ID NO: 1 or SEQ ID NO: 4. Such methods 
are known in the art and include, but are not limited to, the method described by 
Lyamichev et al. (1999, Nature Biotechn 17:292-296). 

25 

The present invention further relates to the specific flanking sequences of EE-GH1 
described herein, which can be used to develop specific identification methods for EE- 
GH1 in biological samples. More particularly, the invention relates to the 5' and or 3' 
flanking regions of EE-GH1, which can be used for the development of specific primers 
30 and probes as well as to the specific primers and probes developed from the 5' and/or 3' 
flanking sequences of EE-GH1 and the insertion site adjacent thereto. The invention 
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further relates to identification methods for the presence of EE-GH1 in biological 
samples based on the use of such specific primers or probes. 

The invention thus also relates to a kit for identifying elite event EE-GH1 in biological 
5 samples, the kit comprising at least one primer or probe which specifically recognizes the 
5' or 3' flanking region of EE-GH1. 

Preferably the kit of the invention comprises, in addition to a primer which specifically 
recognizes the 5' or 3' flanking region of EE-GH1, a second primer which specifically 

10 recognizes a sequence within the foreign DNA of EE-GH1, for use in a PCR 
identification protocol. Preferably, the kit of the invention comprises two (or more) 
specific primers, one of which recognizes a sequence within the 5' flanking region of EE- 
GH1, most preferably a sequence within the plant DNA region of SEQ ID NO: 1, and an 
other which recognizes a sequence within the foreign DNA. Especially preferably, the 

15 primer recognizing the plant DNA sequence within 5' flanking region comprises the 
nucleotide sequence of SEQ ID NO: 2. Particularly, the primer recognizing the plant 
DNA sequence within 5' flanking region comprises the nucleotide sequence of SEQ ID 
NO: 2 and the primer recognizing the foreign DNA comprises the nucleotide sequence of 
SEQ ID NO: 1 described herein. 

20 

According to a further embodiment, the invention relates to a kit for identifying elite 
event EE-GH1 in biological samples, which kit comprises at least one specific primer or 
probe having a sequence which corresponds (or is complementary to) a sequence that 
hybridizes under stringent conditions to a specific or characterizing region of EE-GH1. 

25 Preferably the sequence of the probe corresponds to a specific region comprising part of 
the foreign DNA and part of the 5' or 3' flanking region (plant DNA) of EE-GH1 
contiguous therewith. Most preferably the specific probe comprises (or is complementary 
to) a sequence that hybridizes under stringent conditions to a sequence of 15 to 30 
nucleotides spanning the insertion site at the 5' or 3' region of the foreign DNA within 

30 SEQ ID NO: 1 or SEQ ID NO: 4. Especially preferably the specific probe comprises (or 
is complementary to) a sequence that hybridizes under stringent conditions to a sequence 
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of 9 nucleotides of plant DNA and a similar number of nucleotides of the foreign DNA 
contiguous with the insertion site, within the sequences of SEQ ID NO:l or SEQ ID 
NO:4. According to a particular embodiment, such a primer comprises a sequence that is 
identical to a sequence of 9 nucleotides of plant DNA and a similar number of 
5 nucleotides of the foreign DNA contiguous with the insertion site, within the sequences 
of SEQ ID NO: 1 or SEQ ID NO: 4. 

The methods and kits encompassed by the present invention can be used for different 
purposes such as, but not limited to the following: to identify EE-GH1 in plants, plant 

10 material or in products such as, but not limited to food or feed products (fresh or 
processed) comprising or derived from plant material; additionally or alternatively, the 
methods and kits of the present invention can be used to identify transgenic plant material 
for purposes of segregation between transgenic and non-transgenic material; additionally 
or alternatively, the methods and kits of the present invention can be used to determine 

15 the quality (i.e. percentage pure material) of plant material comprising EE-GH1. 

The present invention further relates to a method for tracking plants comprising elite 
event EE-GH1 in their genome upon introduction into different cultivars. 

20 It will be understood that particular embodiments of the invention are described by the 
dependent claims cited herein. 

Brief description of the Figure 

25 

The following detailed description, given by way of example, but not intended to limit 
the invention to specific embodiments described, may be understood in conjunction with 
the accompanying Figure, incorporated herein by reference, in which: 

30 
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Fig. 1. PCR analysis of other events and elite event EE-GH1 using the EE-GH1 PCR 
identification protocol. Loading sequence of the gel: lane 1, molecular weight 
marker (lOObp ladder), lane 2, DNA sample from a cotton plant comprising the 
transgenic event EE-GH1, lane 3, DNA samples from a cotton plant comprising 
5 another transgenic event, lane 4, DNA from wild-type cotton, lane 5, wild-type + 

1 copy of the pGSV71-BamHI digest (positive control), lane 6, negative control 
(no template), lane 8, molecular weight marker (lOObp ladder). 

10 Detailed Description of Preferred Embodiments 

The term "gene" as used herein refers to any DNA sequence comprising several operably 
linked DNA fragments such as a promoter region, a 5' untranslated region (the 5'UTR), a 
coding region (which may or may not code for a protein), and an untranslated 3' region 
15 (3'UTR) comprising a polyadenylation site. Typically in plant cells, the 5'UTR, the 
coding region and the 3'UTR are transcribed into an RNA of which, in the case of a 
protein encoding gene, the coding region is translated into a protein. A gene may include 
additional DNA fragments such as, for example, introns. As used herein, a genetic locus 
is the position of a given gene in the genome of a plant. 

20 

The term "chimeric" when referring to a gene or DNA sequence is used to refer to the 
fact that the gene or DNA sequence comprises at least two functionally relevant DNA 
fragments (such as promoter, 5'UTR, coding region, 3'UTR, intron) that are not naturally 
associated with each other and/or originate, for example, from different sources. 

25 "Foreign" referring to a gene or DNA sequence with respect to a plant species is used to 
indicate that the gene or DNA sequence is not naturally found in that plant species, or is 
not naturally found in that genetic locus in that plant species. The term "foreign DNA" 
will be used herein to refer to a DNA sequence as it has incorporated into the genome of 
a plant as a result of transformation. The "transforming DNA" as used herein refers to a 

30 recombinant DNA molecule used for transformation. The transforming DNA usually 
comprises at least one "gene of interest" (e.g. a chimeric gene) which is capable of 
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conferring one or more specific characteristics to the transformed plant. The term 
"recombinant DNA molecule" is used to exemplify and thus can include an isolated 
nucleic acid molecule which can be DNA and which can be obtained through 
recombinant or other procedures. 

5 

As used herein the term "transgene" refers to a gene of interest (or the whole of the 
foreign DNA) as incorporated in the genome of a plant. A "transgenic plant" refers to a 
plant comprising at least one transgene in the genome of all of its cells. 

10 The foreign DNA present in the plants of the present invention will preferably comprise a 
herbicide tolerance gene, more specifically a 35S-bar gene as the gene of interest. 

A "herbicide tolerance" gene as used herein refers to a gene that renders the plant tolerant 
to a herbicide. An example of a herbicide tolerance gene is gene comprising a sequence 

15 encoding the enzyme phosphinothricin acetyl transferase, which detoxifies 
phosphinothricin, under the control of a constitutive promoter. More specifically, in the 
elite event of the present invention the herbicide tolerance gene comprises the coding 
sequence of the bialaphos resistance gene (bar) of Streptomyces hygroscopicus 
(Thompson et al (1987) EMBO J 6: 2519-2523) under control of the 35S promoter from 

20 Cauliflower Mosaic Virus (Odell et al., (1985), Nature 313: 810-812), also referred to as 
"35S-6ar" herein. The expression of the 35S-6ar gene confers tolerance to herbicidal 
compounds phosphinothricin or bialaphos or glufosinate, or more generally, glutamine 
synthase inhibitors, or salts or optical isomers thereof which will generally be referred to 
as "glufosinate tolerance" herein. 

25 By hybridizing under "stringent conditions" is meant the conventional hybridizing 
conditions as described by Sambrook et al. (1989) (Molecular Cloning: A Laboratory 
Manual, Second Edition, Cold Spring Harbour Laboratory Press, NY) which for instance 
can comprise the following steps: 1) immobilizing plant genomic DNA fragments on a 
filter, 2) prehybridizing the filter for 1 to 2 hours at 42°C in 50% formamide, 5 X SSPE, 

30 2 X Denhardt's reagent and 0.1% SDS, or for 1 to 2 hours at 68°C in 6 X SSC, 2 X 
Denhardt's reagent and 0.1% SDS, 3) adding the hybridization probe which has been 
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labeled, 4) incubating for 16 to 24 hours, 5) washing the filter for 20 min. at room 
temperature in IX SSC, 0.1 %SDS, 6) washing the filter three times for 20 min. each at 
68°C in 0.2 X SSC, 0.1 %SDS, and 7) exposing the filter for 24 to 48 hours to X-ray film 
at -70°C with an intensifying screen. 

5 

The incorporation of a recombinant DNA molecule in the plant genome typically results 
from transformation of a cell, tissue or callus (or from another genetic manipulation). The 
particular site of incorporation is either random or is at a predetermined location (if a 
process of targeted integration is used). 

10 

The DNA introduced into the plant genome as a result of transformation of a plant cell or 
tissue with a recombinant DNA or "transforming DNA" is hereinafter referred to as 
"foreign DNA" comprising one or more "transgenes". Thus, foreign DNA may comprise 
both recombinant DNA as well as newly introduced, rearranged DNA of the plant. 
15 However, the term "plant DNA" in the context of the present invention will refer to DNA 
of the plant which is generally found in the same genetic locus in the corresponding wild- 
type plant. 

The foreign DNA can be characterized by the location and the configuration at the site of 
incorporation of the recombinant DNA molecule in the plant genome. The site in the 

20 plant genome where a recombinant DNA has been inserted is also referred to as the 
"insertion site" or "target site". Insertion of the recombinant DNA into the plant genome 
can be associated with a deletion of plant DNA, referred to as "target site deletion". A 
"flanking region" or "flanking sequence" as used herein refers to a sequence of at least 20 
bp, preferably at least 50 bp, and up to 5000 bp of the plant genome which is located 

25 either immediately upstream of and contiguous with or immediately downstream of and 
contiguous with the foreign DNA. Transformation procedures leading to random 
integration of the foreign DNA will result in transformants with different flanking 
regions, which are characteristic and unique for each transformant. When the 
recombinant DNA present in a transgenic plant is introduced into a different plant 

30 through traditional crossing, its insertion site in the plant genome, or its flanking regions 
will generally not be changed (apart from occasional changes due to mutations or cross- 
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over and transposons). An "insertion region" as used herein refers to the region 
corresponding to the region of at least 40 bp, preferably at least 100 bp, and up to more 
than 10000 bp, encompassed by the sequence which comprises the upstream and/or the 
downstream flanking region of a foreign DNA in the (untransformed) plant genome (and 
5 including the insertion site and possible target site deletion). Taking into consideration 
minor differences due to mutations within a species, an insertion region will retain at least 
85%, preferably 90%, more preferably 95%, and most preferably 100% sequence identity 
with the sequence comprising the upstream and downstream flanking regions of the 
foreign DNA in a given plant of that species. 

10 

Expression of a gene of interest refers to the fact that the gene confers on the plant one or 
more phenotypic traits (e.g. herbicide tolerance) that were intended to be conferred by the 
introduction of the recombinant DNA molecule - the transforming DNA - used during 
transformation (on the basis of the structure and function of part or all of the gene(s) of 
15 interest). 

An "event" is defined as a (artificial) genetic locus that, as a result of genetic engineering, 
carries a foreign DNA comprising at least one copy of the gene(s) of interest (also 
referred to as a transformation event). An event is characterized phenotypically by the 

20 expression of the transgenes. At the genetic level, an event is part of the genetic makeup 
of a plant. At the molecular level, an event is characterized by the restriction map (e.g. as 
determined by Southern blotting) and/or by the upstream and/or downstream flanking 
sequences of the foreign DNA, and/or the molecular configuration of the foreign DNA 
comprising the transgenes. Usually when transforming a plant cell, tissue or callus with a 

25 transforming DNA, a multitude of events are generated, each of which is unique. 

An "elite event", as used herein, is an event which is selected from a group of events, 
obtained by transformation with the same transforming DNA or by back-crossing with 
plants obtained by such transformation, based on the phenotypic expression and stability 
30 of the transgenes and the absence of negative impact on the agronomic characteristics of 
the plant comprising it (i.e., selected transformation event). Thus the criteria for elite 
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event selection are one or more, preferably two or more, advantageously all of the 
following: 

a) That the presence of the foreign DNA in the plant does not compromise other desired 
characteristics of the plant, such as those relating to agronomic performance or 

5 commercial value; 

b) That the event is characterized by a well defined molecular configuration which is 
stably inherited and for which appropriate diagnostic tools for identity control can be 
developed; 

c) That the gene(s) of interest show(s) an appropriate and stable spatial and temporal 
10 phenotypic expression in homozygous condition of the event, at a commercially 

acceptable level in a range of environmental conditions in which the plants carrying the 
event are likely to be exposed in normal agronomic use. 

It is preferred that the foreign DNA is associated with a position in the plant genome that 
allows introgression into desired commercial genetic backgrounds. 

15 

The status of an event as an elite event is confirmed by introgression of the elite event in 
different relevant genetic backgrounds and observing compliance with one, two or all of 
the criteria e.g. a), b) and c) above. 

20 An "elite event" thus refers to a genetic locus comprising a foreign DNA, which answers 
to the above-described criteria. A plant, plant material or progeny such as seeds can 
comprise one or more elite events in its genome. Thus, when referring to a plant, seed cell 
or tissue comprising elite event EE-GH1 in its genome, a plant, seed cell or tissue is 
intended which comprises the foreign DNA described herein (comprising the 35S-bar 

25 gene) integrated in its genome at the integration site described herein. 

The tools developed to identify an elite event or the plant or plant material comprising an 
elite event, or products which comprise plant material comprising the elite event are 
based on the specific genomic characteristics of the elite event, such as, a specific 
30 restriction map of the genomic region comprising the foreign DNA, molecular markers or 
the sequence of the flanking region(s) of the foreign DNA. 
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Once one or both of the flanking regions of the foreign DNA have been sequenced, 
primers and probes can be developed which specifically recognize this (these) 
sequence(s) in the nucleic acid (DNA or RNA) of a sample by way of a molecular 

5 biological technique. For instance a PCR method can be developed to identify the elite 
event in biological samples (such as samples of plants, plant material or products 
comprising plant material). Such a PCR is based on at least two specific "primers" one 
recognizing a sequence within the 5' or 3' flanking region of the elite event and the other 
recognizing a sequence within the foreign DNA. The primers preferably have a sequence 

10 of between 15 and 35 nucleotides which under optimized PCR conditions "specifically 
recognize" a sequence within the 5' or 3' flanking region of the elite event and the 
foreign DNA of the elite event respectively, so that a specific fragment ("integration 
fragment") is amplified from a nucleic acid sample comprising the elite event. This 
means that only the targeted integration fragment, and no other sequence (of that size) in 

15 the plant genome or foreign DNA, is amplified under optimized PCR conditions. 

Preferably, the integration fragment has a length of between 50 and 500 nucleotides, most 
preferably of between 100 and 350 nucleotides. Preferably the specific primers have a 
sequence which is between 80 and 100% identical to a sequence within the 5' or 3' 
flanking region of the elite event and the foreign DNA of the elite event, respectively, 

20 provided the mismatches still allow specific identification of the elite event with these 
primers under optimized PCR conditions. The range of allowable mismatches however, 
can easily be determined experimentally and are known to a person skilled in the art. 
As the sequence of the primers and their recognized sequence in the genome are unique 
for the elite event, amplification of the integration fragment will occur only in biological 

25 samples comprising (the nucleic acid of) the elite event. Preferably when performing a 
PCR to identify the presence of EE-GH1 in unknown samples, a control is included of a 
set of primers with which a fragment within a "housekeeping gene" of the plant species 
of the event can be amplified. Housekeeping genes are genes that are expressed in most 
cell types and which are concerned with basic metabolic activities common to all cells. 

30 Preferably, the fragment amplified from the housekeeping gene is a fragment which is 
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larger than the amplified integration fragment. Depending on the samples to be analyzed, 
other controls can be included. 

Standard PCR protocols are described in the art, such as in 'PCR Applications Manual" 
5 (Roche Molecular Biochemicals, 2nd Edition, 1999). The optimal conditions for the 
PCR, including the sequence of the specific primers, is specified in a "PCR identification 
protocol" for each elite event. It is however understood that a number of parameters in 
the PCR identification protocol may need to be adjusted to specific laboratory conditions, 
and may be modified slightly to obtain similar results. For instance, use of a different 
10 method for preparation of DNA may require adjustment of, for instance, the amount of 
primers, polymerase and annealing conditions used. Similarly, the selection of other 
primers may dictate other optimal conditions for the PCR identification protocol. These 
adjustments will however be apparent to a person skilled in the art, and are furthermore 
detailed in current PCR application manuals such as the one cited above. 

15 

Alternatively, specific primers can be used to amplify an integration fragment that can be 
used as a "specific probe" for identifying EE-GH1 in biological samples. Contacting 
nucleic acid of a biological sample, with the probe, under conditions which allow 
hybridization of the probe with its corresponding fragment in the nucleic acid, results in 

20 the formation of a nucleic acid/probe hybrid. The formation of this hybrid can be detected 
(e.g. labeling of the nucleic acid or probe), whereby the formation of this hybrid indicates 
the presence of EE-GH1. Such identification methods based on hybridization with a 
specific probe (either on a solid phase carrier or in solution) have been described in the 
art. The specific probe is preferably a sequence which, under optimized conditions, 

25 hybridizes specifically to a region comprising part of the 5' or 3' flanking region of the 
elite event and also comprising part of the foreign DNA contiguous therewith (hereinafter 
also referred to as a "specific region" of the event). Depending on the method used, such 
a specific probe can comprise a sequence of between 15 and 30 bp or of between 50 and 
500 bp, preferably of 100 to 350 bp which hybridizes under stringent conditions to the 

30 nucleotide sequence (or the complement of such sequence) of a specific region. 
Preferably, the specific probe will comprise a sequence of about 15 to about 100 
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contiguous nucleotides identical (or complementary) to a specific region of the elite 
event. Most preferably such a probe comprises at least 9 nucleotides identical to (or 
complementary to the 3' or 5' flanking region and a similar number of nucleotides in the 
foreign DNA contiguous therewith. Most preferably, such a probe comprises at least 9 
5 nucleotides identical to (or complementary to the 3' or 5' flanking region and a similar 
number of nucleotides in the foreign DNA contiguous therewith of SEQ ID NO: 1 or 
SEQ ID NO: 4. 

A "restriction map" as used herein refers to a set of Southern blot patterns obtained after 
10 cleaving plant genomic DNA (and/or the foreign DNA comprised therein) with a 
particular restriction enzyme, or set of restriction enzymes and hybridization with a probe 
sharing sequence similarity with the foreign DNA under standard stringency conditions. 
Standard stringency conditions as used herein refers to the conditions for hybridization 
described herein or to the conventional hybridizing conditions as described by Sambrook 
15 et al (1989) (Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring 
Harbour Laboratory Press, NY) which for instance can comprise the following steps: 1) 
immobilizing plant genomic DNA fragments on a filter, 2) prehybridizing the filter for 1 
to 2 hours at 42°C in 50% formamide, 5 X SSPE, 2 X Denhardt's reagent and 0.1% SDS, 
or for 1 to 2 hours at 68°C in 6 X SSC, 2 X Denhardt's reagent and 0.1% SDS, 3) adding 
20 the hybridization probe which has been labeled, 4) incubating for 16 to 24 hours, 5) 
washing the filter for 20 min. at room temperature in IX SSC, 0.1 %SDS, 6) washing the 
filter three times for 20 min. each at 68°C in 0.2 X SSC, 0.1 %SDS, and 7) exposing the 
filter for 24 to 48 hours to X-ray film at -70°C with an intensifying screen. 

25 Due to the (endogenous) restriction sites present in a plant genome prior to incorporation 
of the foreign DNA, insertion of a foreign DNA will alter the specific restriction map of 
that genome. Thus, a particular transformant or progeny derived thereof can be identified 
by one or more specific restriction patterns. 

30 Alternatively, plants or plant material comprising an elite event can be identified by 
testing according to a PCR identification protocol. This is a PCR which specifically 
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recognizes the elite event. Essentially, a set of PCR primers is developed which 
recognizes a) a sequence within the 3' or 5' flanking sequence of the elite event and b) a 
sequence within the foreign DNA, which primers amplify a fragment (integration 
fragment) preferably of between 100 and 300 nucleotides. Preferably, a control is 
5 included of a set of primers which amplifies a fragment within a housekeeping gene of 
the plant species (preferably a fragment which is larger than the amplified integration 
fragment). The optimal conditions for the PCR, including the sequence of the specific 
primers is specified in a PCR identification protocol. 

10 Other methods for identifying plants, plant material or products comprising plant material 
comprising elite event EE-GH1 are also envisaged. These methods include all methods 
based on the detection of the foreign DNA sequence and flanking sequence(s) of the elite 
event with a specific probe. More particularly, chip-based technologies, such as those 
described by Hacia et al 1996 (Nat Genet 1 4(4) :44 1-447) and Shoemaker et al 1996 

15 (Nat Genet 14(4):450-456) are envisaged. These methods allow segregation of target 
molecules as high-density arrays by using fixed probe arrays or by tagging of the genes 
with oligonucleotides, after which they can be screened by hybridization. Alternatively, 
third wave detection methods, such as those described by Lyamichev et al (1999, above) 
are envisaged. 

20 

Identification of the protein(s) encoded by the foreign DNA of the elite event can be done 
by classical protein detection methods described in the art, such as those based on 
chromatographic or electromagnetic properties of the protein or the detection by specific 
monoclonal antibodies (as described in "Guide to protein purification, Murray P. 
25 Deutscher editor). 

A "kit" as used herein refers to a set of reagents for the purpose of performing the method 
of the invention, more particularly, the identification of the elite event EE-GH1 in 
biological samples. More particularly, a preferred embodiment of the kit of the invention 
30 comprises at least one or two specific primers, as described above. Optionally, the kit can 
further comprise any other reagent described herein in the PCR identification protocol. 
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Alternatively, according to another embodiment of this invention, the kit can comprise a 
specific probe, as described above, which specifically hybridizes with nucleic acid of 
biological samples to identify the presence of EE-GH1 therein. Optionally, the kit can 
further comprise any other reagent (such as but not limited to hybridizing buffer, label) 
5 for identification of EE-GH1 in biological samples, using the specific probe. 

The kit of the invention can be used, and its components can be specifically adjusted, for 
purposes of quality control (e.g., purity of seed lots), detection of the elite event in plant 
material or material comprising or derived from plant material, such as but not limited to 
10 food or feed products. 

The present invention relates to the development of an elite event in cotton, EE-GH1, to 
the plants comprising this event, the progeny obtained from these plants and to the plant 
cells, or plant material derived from this event. Plants comprising elite event EE-GH1 
15 were obtained through transformation with pGSV71 as described in example 1. 

Cotton plants or plant material comprising EE-GH1 can be identified according to the 
PCR identification protocol described for EE-GH1 in Example 4 herein. Briefly, cotton 
genomic DNA is amplified by PCR using a primer which specifically recognizes a 

20 sequence within the 5' or 3' flanking sequence of EE-GH1, particularly the primer with 
the sequence of SEQ ID NO: 2, and a primer which recognizes a sequence in the foreign 
DNA, particularly the primer with the sequence of SEQ ID NO: 3. Endogenous cotton 
DNA primers are used as controls. If the plant material yields a fragment of between 250 
and 290 bp, preferably of about 269 bp, the cotton plant is determined to harbor elite 

25 event EE-GH1. 

Plants harboring EE-GH1 are characterized by their glufosinate tolerance, which in the 
context of the present invention includes that plants are tolerant to the herbicide 
Liberty™. Tolerance to Liberty™ can be tested in different ways. The leaf paint method 
30 as described herein, is most useful when you wish to identify both resistant and sensitive 
plants, but do not want to kill the sensitive ones. Alternatively, tolerance can be tested by 
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Liberty™ spray application. Spray treatments should be made between the leaf stages V3 
and V4 for best results. Tolerant plants are characterized by the fact that spraying of the 
plants with at least 200 grams active ingredient/hectare (g.a.i./ha), preferably 400 
g.a.i./ha, and possibly up to 1600 g.a.i./ha (4X the normal field rate), does not kill the 
5 plants. A broadcast application should be applied at a rate of 28-34 oz Liberty™. It is 
best to apply at a volume of 20 gallons of water per acre using a flat fan type nozzle while 
being careful not to direct spray applications directly into the whorl of the plants to avoid 
surfactant burn on the leaves. The herbicide effect should appear within 48 hours and be 
clearly visible within 5-7 days. 

10 

Plants harboring EE-GH1 can further be characterized by the presence in their cells of 
phosphinothricin acetyl transferase as determined by a PAT assay (De Block et al, 1987, 
supra). 

15 Plants harboring EE-GH1 can, for example, be obtained from seeds deposited at the 
ATCC under accession number PTA-3343, which contain 50% kernels that are 
hemizigous for the elite event. Such plants can be further propagated to introduce the elite 
event of the invention into other cultivars of the same plant species. Selected seeds 
obtained from these plants contain the elite event stably incorporated into their genome. 

20 The invention further relates to plants derived from the ATCC accession number PTA- 
334, comprising EE-GH1. The term "derived from" herein indicates that the plants are 
related, i.e. they are both progeny (direct or of two or more generations) of the same 
transformant by crossing. 

Plants harboring EE-GH1 are also characterized by having agronomical characteristics 
25 that are comparable to commercially available varieties of cotton in the US, in the 
absence of weed pressure and use of Liberty™ for weed control. It has been observed 
that the presence of a foreign DNA in the insertion region of the cotton plant genome 
described herein, confers particularly interesting phenotypic and molecular characteristics 
to the plants comprising this event. More specifically, the presence of the foreign DNA in 
30 this particular region in the genome of these plants, results in plants which display a 
stable phenotypic expression of the gene of interest without significantly compromising 
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any aspect of desired agronomic performance of the plants. Thus, the insertion region, 
corresponding to a sequence comprising the plant DNA of SEQ ID NO: 1 and/or SEQ ID 
NO: 4, more particularly a sequence corresponding to SEQ ID NO: 5, most particularly 
the insertion site of EE-GH1 therein, is shown to be particularly suited for the 

5 introduction of a gene(s) of interest. More particularly, the insertion region of EE-GH1 
(corresponding to a DNA sequence of at least 40 bp in the cotton genome within SEQ ID 
NO: 5), or a sequence of at least 40 bp which hybridizes under stringent conditions to the 
complement of the sequence of SEQ ID NO: 5, is particularly suited for the introduction 
of foreign DNA comprising a herbicide tolerance gene, ensuring expression of each of 

10 these genes in the plant without compromising agronomic performance. 

A recombinant DNA molecule can be specifically inserted in an insertion region by 
targeted insertion methods. Such methods are well known to those skilled in the art and 
comprise, for example, homologous recombination using a recombinase such as, but not 

15 limited to the FLP recombinase from Saccharomyces cerevisiae (published PCTP 
application WO 99/25821), the CRE recombinase from Escherichia coli phage PI 
(published PCT application WO 99/25840), the recombinase from pSRl of 
Saccharomyces rouxii (Araki et al 1985, J Mol Biol 182:191-203), the Gin/gix system of 
phage Mu (Maeser and Kahlmann, 1991, Mol Gen Genetics 230:170-176) or the lambda 

20 phage recombination system (such as described in US Patent 4,673,640). 

As used herein, "sequence identity" with regard to nucleotide sequences (DNA or RNA), 
refers to the number of positions with identical nucleotides divided by the number of 
nucleotides in the shorter of the two sequences. The alignment of the two nucleotide 

25 sequences is performed by the Wilbur and Lipmann algorithm (Wilbur and Lipmann, 
1983, Proc Nat Acad Sci USA 80:726) using a window-size of 20 nucleotides, a word 
length of 4 nucleotides, and a gap penalty of 4. Computer-assisted analysis and 
interpretation of sequence data, including sequence alignment as described above, can, 
e.g., be conveniently performed using the programs of the Wisconsin Package (from the 

30 Genetics Computer Group, Inc). Sequences are indicated as "essentially similar" when 
such sequences have a sequence identity of at least about 75%, particularly at least about 
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80%, more particularly at least about 85%, quite particularly about 90%, especially about 
95%, more especially about 100%, quite especially are identical. It is clear that when 
RNA sequences are said to be essentially similar or have a certain degree of sequence 
identity with DNA sequences, thymine (T) in the DNA sequence is considered equal to 
5 uracil (U) in the RNA sequence. "Complementary to" as used herein refers to the 
complementarity between the A and T (U), and G and C nucleotides in nucleotide 
sequences. 

As used in herein, a "biological sample" is a sample of a plant, plant material or products 
10 comprising plant material. The term "plant" is intended to encompass cotton (such as but 
not limited to Gossypium hirsutum) plant tissues, at any stage of maturity, as well as any 
cells, tissues, or organs taken from or derived from any such plant, including without 
limitation, any seeds, leaves, stems, flowers, roots, single cells, gametes, cell cultures, 
tissue cultures or protoplasts. "Plant material", as used herein refers to material which is 
15 obtained or derived from a plant. Products comprising plant material relate to food, feed 
or other products which are produced using plant material or can be contaminated by 
plant material. It is understood that, in the context of the present invention, such 
biological samples are preferably tested for the presence of nucleic acids specific for EE- 
GH1, implying the presence of nucleic acids in the samples. Thus the methods referred to 
20 herein for identifying elite event EE-GH1 in biological samples, preferably relate to the 
identification in biological samples of nucleic acids which comprise the elite event. 

As used herein "comprising" is to be interpreted as specifying the presence of the stated 
features, integers, steps or components as referred to, but does not preclude the presence 

25 or addition of one or more features, integers, steps or components, or groups thereof. 
Thus, e.g., a nucleic acid or protein comprising a sequence of nucleotides or amino acids, 
may comprise more nucleotides or amino acids than the actually cited ones, i.e., be 
embedded in a larger nucleic acid or protein. A chimeric gene comprising a DNA 
sequence which is functionally or structurally defined, may comprise additional DNA 

30 sequences, etc. 
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The following examples describe the development and characteristics of cotton plants 
harboring the elite events EE-GH1 as well as the development of tools for the 
identification of elite event EE-GH1 in biological samples. 

5 Unless otherwise stated, all recombinant DNA techniques are carried out according to 
standard protocols as described in Sambrook et al. (1989) Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbour Laboratory Press, NY and in 
Volumes 1 and 2 of Ausubel et al (1994) Current Protocols in Molecular Biology, 
Current Protocols, USA. Standard materials and methods for plant molecular work are 

10 described in Plant Molecular Biology Labfax (1993) by R.D.D. Croy published by BIOS 
Scientific Publications Ltd (UK) and Blackwell Scientific Publications, UK. 

In the description and examples, reference is made to the following sequences: 





SEQ ID NO: 1 : 


sequence comprising the 5' flanking region 


15 


SEQ ID NO: 2: 


primer GHI06 




SEQ ID NO: 3: 


primer GHI05 




SEQ ID NO: 4: 


sequence comprising the 3' flanking region 




SEQ ID NO: 5: 


insertion region 




SEQ ID NO: 6: 


plasmid pGSV71 


20 


SEQ ID NO: 7: 


plasmid pRVA44 




SEQ ID NO: 8 


primer MDB327 




SEQ ID NO: 9: 


primer MLD015 




SEQ ID NO: 10 


primer MLD016 




SEQ ID NO: 1 1 


primer MDB6 12 


25 


SEQ ID NO: 12 


primer MDB053 




SEQ ID NO: 13 


primer MDB356 




SEQ ID NO: 14 


primer DP AO 17 




SEQ ID NO: 15 


: primer MLD01 9 




SEQ ID NO: 16 


: sequence comprising target site deletion 


30 


SEQ ID NO: 17 


primer GHI01 




SEQ ID NO: 18 


: primer GHI02 



22 



WO 03/013224 



PCT/EP02/08136 



EXAMPLES 

Example 1 : Transformation of cotton with a herbicide tolerance gene 

5 1.1. Construction of the chimeric DNA comprising the bar gene under the control of 
a constitutive promoter 

A plasmid pGSV71 was constructed following standard procedures. The sequence of the 
genetic elemennts of plasmid pGSV71 is given in table 1 (SEQ ID NO: 6): 

10 



Table 1 . Nucleotide positions of the genetic elements in pGSV71 



Nt positions 


Abbreviation 


Description and references 


198-222 




Right border Repeat from the TL-DNA from pTiB6S3 
(Gielen et aL, 1984, EMBO J. 3:835-846) 


223-249 




Polylinker 


250-1634 ! 


P35S3 


Promoter of the 35 S RNA of Cauliflower Mosaic Virus 
(Odell et at., (1985), Nature 313: 810-812) 


1635-2186 


bar 


Coding sequence encoding phosphinothricine-acetyl- 
transferase from Streptomyces hygroscopicus (Thompson et 
al., (1987) EMBO J. 6: 2519-2523). The N-terminal two 
codons of the wild-type bar coding region have been 
substituted for codons ATG and GAC respectively. 


2187-2205 




Polylinker 


2206-2465 


3 'nos 


A 260 bp TaqI fragment from the 3' untranslated region of 
the nopaline-synthase gene originating from the T-DNA of 
pTiT37 (Depicker et aL, 1982, J. Mol. Appl. Genet. 1: 561- 
573 ) 


2466-2519 




Polylinker 


2520-2544 




Left border Repeat from the TL-DNA from pTiB6S3 (Gielen 
et a/., 1984, EMBO J. 3:835-846) 



1.2. Transformation of Gossypium hirsutum 

15 

Cotton tissue from Coker312 plants was transformed with pGSV71 using Agrobacterium 
transformation (US 5,986,181) and regenerated to plants on appropriate media. 
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The small plantlets initiated on the selective regeneration media were transferred to new 
medium for germination (all medium is hormone-free). Plantlets were then transferred to 
the growth chambers or to the greenhouses. 

5 Selection was done orv phosphinothricin (PPT) at all stages except plantlet regeneration, 
which was done in the absence of PPT to accelerate growth. This resulted in a set of 
primary transformants (plants of generation TO). 

10 Example 2 : Development of events 

2.1. Development of lines carrying the event trait 

TO shoots were transferred to greenhouse soil and plants were screened for glufosinate 
15 tolerance and for the presence of the PAT enzyme with a PAT ELISA (Steffens 
Biotechnische Analysen GmbH, Ebringen, Germany). 

Tl to T3 plants were grown in the greenhouse and tested for Liberty™ tolerance at a 2x 
rate (by spraying 56 oz/ha). Positive plants were tested for expression of the bar gene 
20 using the Pat assay as described by Deblock et al. 1987 (EMBO J. 6:2513-2518). 

Presence of the foreign DNA and copy number was checked by Southern blot analysis. 
Total genomic DNA was isolated from lg of leaf tissue according to the CETAB method 
of Doyle et al. (1987, Phytochem. Bull. 19:11) and digested with EcoRI restriction 
25 enzyme. 

Probes such as the following were used for Southern analysis : 
"bar" probe: 474 bp KpnI-Bgll digest of plasmid pDEl 10 (WO 92/09696) 
"35S" probe: 892 bp NcoI-MunI digest of plasmid pRVA44 (SEQ ID NO: 7) 
T2 Plants were also evaluated for general phenotypic characteristics compared to the non- 
30 transgenic isogenic lines. In later generations, the lines for which no negative penalties on 
phenotype or agronomic performance was observed for the presence of the transgene 
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either in hemizygous or in homozygous condition, as compared to wild-types were 
selected. 

T4 material was grown in the field and tested under field conditions for Liberty™ 
5 tolerance according to different schedules. 

In later generations, plants were compared to commercial varieties for yield, fiber quality 
and plant mapping data. Agronomic characteristics, such as plant height, height to node, 
boll retention, stand, vigor, fiber length, fiber strength and lint yield were evaluated. 

10 

It was determined that one event performed equally or better than the comparable checks 
and that for this event yield was dependent on background rather than on presence of the 
transgene. 

15 2.2. Selection of an elite event 

This selection procedure, yielded one elite event which displayed optimal expression of 
the 35S-bar gene, i.e. tolerance to glufosinate ammonium, without penalty on agronomic 
performance and yield. This elite event was named EE-GH1. 

20 

2.3. Testing of EE-GH1 in cotton varieties with different genetic backgrounds and in 
different locations 

The selected event was introduced into different commercial genetic backgrounds, 
25 including FM989, FM 832, FM958, and FM966 and results of field trials of four different 
locations were compared. Plants were sprayed with 1600 g.a.i./ha, using different 
treatments (1x3-5 leaf stage, 4x, 3-5 leaf stage, lx+lx, 3-5 leaf stage, 4x+4x, 3-5 leaf 
stage, 0 as control). 

Seedling emergence and vigor rating for the elite event was very good. 
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No visible damage as a result of herbicide application was ever observed after application 
regardless of rate or stage of development at the time of application. There were no 
detrimental effects on morphology or growth habit of plants by herbicide application 

5 Furthermore, the event had normal leaf, flower and boll morphology, excellent fertility, 
and showed no disease or abnormal insect susceptibility in multiple genetic backgrounds. 
During introgression into multiple genetic backgrounds no aberrant problems or 
abnormalities were observed over four generations. 

10 2.4. Genetic analysis of the locus 

The genetic stability of the insert for the EE-GH1 event was checked by molecular and 
phenotypic analysis in the progeny plants over several generations. 

Southern blot analyses of plants of the Tl ? T2 and T3 generation were compared for the 
15 EE-GH1 event. The patterns obtained were found to be identical in the different 
generations. This proves that the molecular configuration of the foreign DNA in EE-GH1 
was stable. 

The EE-GH1 event displayed Mendelian segregation for the transgene as a single genetic 
20 locus in at least three subsequent generations indicating that the insert is stable. 

Example 3 : Characterization of elite event EE-GH1 

25 3.1 In-depth molecular and genetic analysis of the locus 

Once the EE-GH1 event was identified as the event in which expression of the transgene 
as well as overall agronomic performance were optimal, the locus of the transgene was 
analyzed in detail on a molecular level. This included sequencing of the flanking regions 
30 of the transgene. 
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The sequence of the regions flanking the inserted transgene in the EE-GH1 event was 
determined using the TAIL-PCR protocol as described by Liu et al. (1995, Plant J. 8(3): 
457-463). 

5 a) Determination of the 5' flanking region 



The primers used were: 







Sequence (5' -> 3') 


Position in 
pGSV71 


Degenerate 
primer 


MDB327 


NTg. Agg. WTC .NWg.TS A.T 
(SEQ ID NO: 8) 




Primary TAIL 


MLD015 


Tgg.TTC.CTA.gCg.TgA.gCC.AgT.g 
(SEQ ID NO: 9) 


606^585 


Second. TAIL 


MLD016 


AgC.TgC.TgC.TCT.TgC.CTC.TgT 
(SEQ ID NO: 10) 


467^447 


Tertiary TAIL 


GHI05 


ggA.CCg.TTA.TAC.ACA.ACg.Tag 
(SEQ ID NO: 3) 


358->338 



Whereby N=A,C f T or g; S=C or g; W=A or T 



10 

The fragment amplified using MDB327-GHI05 was ca. 1200 bp which was sequenced 
(5' flank: SEQ ID NO: 1). The sequence between bp 1 and bp 677 comprised plant DNA, 
while the sequence between bp 678 and bp 850 corresponded to pGSV71 DNA. 

15 b) Determination of the 3' flanking region 
The primers used were: 
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Sequence (5' -> 3') 


Position in 
pGSV71 


Degenerate 
primer 


MDB612 


NgT.gCT.SWg.ANA.WgA.T 
(SEQIDNO: 11) 




Primary 
TAIL 


MDB053 


CAT.gAC.gTg.ggT.TCC.Tgg.Cag.C 
(SEQ ID NO: 12) 


2109-2130 


Secondary 
TAIL 


MDB356 


AAT.CCT.gTT.gCC.ggT.CTT.gCg 
(SEQIDNO: 13) 


2252-2272 


Tertiary TAIL 


DPA017 


gAT.TAgAgT.CCC.gCAATTATA.C 
(SEQ ID NO: 14) 


2362-2383 



Whereby: N = A,C,T or g; S = C or g; W = A or T 



The fragment amplified using MDB612-DPA017 was ca. 400 bp, the complete sequence 
5 of which was determined (SEQ ID NO: 4). The sequence between nucleotide 1 and 179 
corresponds to T-DNA, while the sequence between nucleotide 180 and 426 corresponds 
to plant DNA. 

c) Identification of the target site deletion 

10 

Using primers corresponding to sequences within the flanking regions of the transgene on 
the wildtype Gossypium hirsutum as a template, the insertion site of the transgene was 
identified. 

15 The following primers were used: 

Position in Position in 

Sequence (5' -> 3') 5'flank 3'flank 

(SEQ ID NO: 1 ) (SEQ ID NO: 4) 

GHI06 TTg.CAC.CAT.CTA.gCT.CAC.TC 815 -» 795 

(SEQ ID NO: 2) 
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MLD019 CAA.gAT.gCg.AgC.AAC.TAT.gT (SEC 285 ^ 266 

ID NO: 15) 

This yielded a 200 bp fragment (SEQ ID NO: 16) in which bp 85 to 122 corresponds to a 
target site deletion. 

5 Thus, the insertion region (SEQ ID NO: 5) as sequenced comprises: 

1-677: 5 5 flanking region bp 1 to 677 of SEQ ID NO: 1 

678-714: target site deletion bp 85 to 122 of SEQ ID NO: 16 

715-916: 3' flanking region bp 180 to 426 of SEQ ID NO: 4 

10 3.2. Genetic analysis of the locus 

The genetic stability of the insert was checked by molecular and phenotypic analysis in 
the progeny plants over several generations. 

Southern blot analyses on glufosinate tolerant plants of EE-GH1 cotton plants of the T 0j 
15 Tj and T 2 generation were compared and were found to be identical. This proves that the 
molecular configuration of the transgene in EE-GH1 containing plants was stable. 

The EE-GH1 event displayed Mendelian segregation for the transgene as a single genetic 
locus in at least three subsequent generations indicating that the insert is stable. 

20 

On the basis of the above results EE-GH1 was identified as an elite event. 



Example 4: Development of diagnostic tools for identity control 

25 

A EE-GH1 Elite event PCR Identification protocol was developed to identify the 
presence of EE-GH1 in plants, plant material or biological samples. 
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EE-GH1 Elite event Polymerase Chain Reaction identification protocol 



A test run, with all appropriate controls, has to be performed before attempting to screen 
unknowns. The presented protocol might require optimization for components that may 
differ between labs (template DNA preparation, Taq DNA polymerase, quality of the 
primers, dNTP's, thermocyler, etc.). 

Amplification of the endogenous sequence plays a key role in the protocol. One has to 
attain PCR and thermocycling conditions that amplify equimolar quantities of both the 
endogenous and transgenic sequence in a known transgenic genomic DNA template. 
Whenever the targeted endogenous fragment is not amplified or whenever the targeted 
sequences are not amplified with the same ethidium bromide staining intensities, as 
judged by agarose gel electrophoresis, optimization of the PCR conditions may be 
required. 

Template DNA 

Template DNA is prepared according to the CTAB method described by Doyle and 
Doyle (1987, Phytochem. Bull. 19: 1 1). When using DNA prepared with other methods, a 
test run utilizing different amounts of template should be done. Usually 50 ng of genomic 
template DNA yields the best results. 

Assigned positive and negative controls 

The following positive and negative controls should be included in a PCR run: 

- Master Mix control (DNA negative control). This is a PCR in which no DNA is added 
to the reaction. When the expected result, no PCR products, is observed this indicates 
that the PCR cocktail was not contaminated with target DNA. 
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- A DNA positive control (genomic DNA sample known to contain the transgenic 
sequences). Successful amplification of this positive control demonstrates that the 
PCR was run under conditions which allow for the amplification of target sequences. 



5 - A wildtype DNA control. This is a PCR in which the template DNA provided is 
genomic DNA prepared from a non-transgenic plant. When the expected result, no 
amplification of the transgene PCR product but amplification of the endogenous PCR 
product, is observed this indicates that there is no detectable transgene background 
amplification in a genomic DNA sample. 

10 

Primers 



The following primers, which specifically recognize the transgene and a flanking 
sequence of EE-GH1 are used: 

15 



Primer 


Sequence (5' -> 3') 


Position in 
SEQ ID NO: 1 


Target 


GHI05 


ggA.CCg.TTA.TAC.ACA.ACg.Tag 
(SEQ ID NO: 3) 


758-* 738 


pGSV71 
sequence 


GHI06 


TTg.CAC.CAT.CTA.gCT.CAC.TC 
(SEQ ID NO: 2) 


815^ 795 


Plant DNA 
Sequence 



Primers targeting an endogenous sequence are always included in the PCR cocktail. 
These primers serve as an internal control in unknown samples and in the DNA positive 
control. A positive result with the endogenous primer-pair demonstrates that there is 
20 ample DNA of adequate quality in the genomic DNA preparation for a PCR product to be 
generated. The endogenous primers used are: 



GHI01: 5'-AAC.CTA.ggC.TgC.TgA.Agg.AgC-3' (SEQ ID NO: 17) 

(Alcohol dehydrogenase gene Acc. NO: AF036569, 1070->1090) 

25 
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GHI02: 5'-CAA.CTC.CTC.CAg.TCA.TCT.CCg-3' (SEQ ID NO: 18) 

(Alcohol dehydrogenase gene Acc. NO: AF036569, 1515->1495) 

Amplified fragments 

The expected amplified fragments in the PCR reaction are: 

For primer pair GHIO 1 -GHI02 : 445bp (endogenous control) 

For primer pair GHI05-GHI06: 269bp (EE-GH 1 Elite Event) 

PCR conditions 

The PCR mix for 50jal reactions contains: 
5 \il template DNA 

5 |il lOx Amplification Buffer (supplied with Taq polymerase) 

1 |al lOmMdNTP's 

0.5 |il GHI01 (10pmoles/|al) 

0.5 |nl GHI02 (10pmoles/|il) 

1 |il GHI05(10pmoles/|al) 

1 \xl GHI06 (10pmoles/|Lil) 

0.2 jal Taq DNA polymerase (5 units/^1) 

water up to 50 |il 

The thermocycling profile to be followed for optimal results is the following: 

4 min. at 95°C 
Followed by: 1 min. at 95°C 

1 min. at 57°C 

2 min. at 72°C 
For 5 cycles 

Followed by: 30 sec. at 92°C 
30 sec. at 57°C 
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1 min. at 72°C 
For 25 cycles 
Followed by: 5 minutes at 72°C 



5 Agarose gel analysis . 

Between 10 and 20|jl of the PCR samples should be applied on a 1.5% agarose gel (Tris- 
borate buffer) with an appropriate molecular weight marker (e.g. lOObp ladder 
PHARMACIA). 

10 

Validation of the results 



Data from transgenic plant DNA samples within a single PCR run and a single PCR 
cocktail should not be acceptable unless 1) the DNA positive control shows the expected 
15 PCR products (transgenic and endogenous fragments), 2) the DNA negative control is 
negative for PCR amplification (no fragments) and 3) the wild-type DNA control shows 
the expected result (endogenous fragment amplification). 

Lanes showing visible amounts of the transgenic and endogenous PCR products of the 
20 expected sizes, indicate that the corresponding plant from which the genomic template 
DNA was prepared, has inherited the EE-GH1 elite event. Lanes not showing visible 
amounts of the transgenic PCR product and showing visible amounts of the endogenous 
PCR product, indicate that the corresponding plant from which the genomic template 
DNA was prepared, does not comprise the elite event. Lanes not showing visible amounts 
25 of the endogenous and transgenic PCR products, indicate that the quality and/or quantity 
of the genomic DNA didn't allow for a PCR product to be generated. These plants cannot 
be scored. The genomic DNA preparation should be repeated and a new PCR run, with 
the appropriate controls, has to be performed. 
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Use of discriminating PCR protocol to identify EE-GH1 

Cotton leaf material from plants comprising different transgenic events (samples 1 to 4) 
was tested according to the above-described protocol. Samples from cotton wild-type 
5 were taken as negativexontrols. 

The results of the PCR analysis are illustrated in Figure 1 . Sample 1 is recognized as 
comprising elite event EE-GH1. All other tested lines do not comprise this elite event. 

10 

Example 5: Introgression of EE-GH1 into preferred cultivars 

Elite event EE-GH1 is introduced by repeated back-crossing into the following 
commercial cotton cultivars: FM5013, FM5015, FM5017, FM989, FM832, FM966 and 
15 FM958. 

It is observed that the introgression of the elite event into these cultivars does not 
significantly influence any of the desirable phenotypic or agronomic characteristics of 
these cultivars (no linkage drag) while expression of the transgene, as determined by 
20 glufosinate tolerance, meets commercially acceptable levels. This confirms the status of 
event EE-GH1 as an elite event. 

As used in the claims below, unless otherwise clearly indicated, the term "plant" is 
intended to encompass plant tissues, at any stage of maturity, as well as any cells, tissues, 
25 or organs taken from or derived from any such plant, including without limitation, any 
seeds, leaves, stems, flowers, roots, single cells, gametes, cell cultures, tissue cultures or 
protoplasts. 

Reference seed comprising elite event EE-GH1 was deposited as EE-GH1 at the ATCC 
30 (10801 University Blvd., Manassas, VA 201 10-2209) on April 26, 2001, under ATCC 
accession number PTA-3343. 
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As used in the claims below, unless otherwise clearly indicated, the term "plant" is 
intended to encompass plant tissues, at any stage of maturity, as well as any cells, tissues, 
or organs taken from or derived from any such plant, including without limitation, any 
seeds, leaves, stems, flowers, roots, single cells, gametes, cell cultures, tissue cultures or 
protoplasts. 

The above description of the invention is intended to be illustrative and not limiting. 
Various changes or modifications in the embodiments described may occur to those skilled 
in the art. These can be made without departing from the spirit or scope of the invention. 
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WHAT IS CLAIMED IS: 

1 . A transgenic cotton plant or seed, cells or tissues thereof comprising event EE-GH1 
in its genome. 

2. The transgenic cotton plant, or seed, cells or tissues of claim 1, the genomic DNA 
of which, when analyzed using the Elite event identification protocol for EE-GH1 
with two primers comprising the nucleotide sequence of SEQ ID NO: 2 and SEQ 
ID NO: 3 respectively, yields a DNA fragment of between 250 and 290 bp. 

3. The cotton plant, or seed, cells or tissues thereof, according to claim 2, wherein said 
DNA fragment is a fragment of about 269 bp. 

4. A cotton plant, or seed, cells or tissues thereof, obtained by propagation of and/or 
breeding with a cotton plant grown from the seed deposited at the ATCC under 
accession number PTA-3343. 

5. A cotton plant, seed, cells or tissues thereof which is the progeny of the seed 
deposited at the ATCC under accession number PTA-3343. 

6. A method for identifying elite event EE-GH1 in biological samples, which method 
comprises detecting an EE-GH1 specific region with a primer or probe which 
specifically recognizes the 5' flanking region of SEQ ID NO: 3 or the 3' flanking 
region of SEQ ID NO: 4 of EE-GH1. 

7. The method of claim 6, said method comprising amplifying a DNA fragment of 
between 100 and 350 bp from a nucleic acid present in said biological samples 
using a polymerase chain reaction with at least two primers, one of which 
recognizes the 5' or 3' flanking region of EE-GH1 and the other which recognizes a 
sequence within the foreign DNA of EE-GH1. 
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8. The method of claim 7, wherein said one primer recognizes a sequence within the 
5' flanking region of SEQ ID NO: 3 and said other primer recognizes a sequence 
within the foreign DNA of EE-GH1. 

9. The method of claim 8, wherein said primer recognizing a sequence within the 5' 
flanking region of EE-GH1 comprises the sequence of SEQ ID NO: 2. 

10. The method of any one of claims 6 to 9, wherein said primer recognizing a 
sequence within the foreign DNA comprises the sequence of SEQ ID NO: 1. 

11. A method for identifying EE-GH1 in a biological sample, which method comprises 
detecting an EE-GH1 specific region with a specific primer or probe which 
hybridizes under stringent conditions to a sequence within the 5' of SEQ ID NO: 3 
or within the 3' flanking sequence of SEQ ID NO: 4 of EE-GH1. 

12. A method for identifying a transgenic plant, or cells or tissues thereof, comprising 
the elite event EE-GH1, which method comprises establishing that genomic DNA 
can be used, according to a PCR identification protocol, to amplify a DNA 
fragment of between 250 and 290 bp, using a polymerase chain reaction with two 
primers having the nucleotide sequence of SEQ ID NO: 1 and SEQ ID NO: 2, 
respectively. 

13. A kit for identifying elite event EE-GH1 in biological samples, said kit comprising 
at least one PCR primer or probe, which recognizes a sequence within the 5' 
flanking region of SEQ ID NO: 3 or the 3' flanking region of SEQ ID NO: 4 of EE- 
GH1. 

14. The kit of claim 13, wherein said at least one PCR primer recognizes a sequence 
within the plant DNA in SEQ ID NO: 3. 
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15. The kit of claim 14, wherein said primer recognizing a sequence within the plant 
DNA in SEQ ID NO: 3 comprises the sequence of SEQ ID NO: 2. 

16. The kit of Claims 13 to 15, which further comprises at least a second PCR primer 
or probe which recognizes a sequence within the foreign DNA of EE-GH1. 

17. The kit of claim 16, wherein said primer recognizing a sequence within the foreign 
DNA of EE-GH1 comprises the sequence of SEQ ID NO: 1. 

18. A method for confirming seed purity, which method comprises detecting an EE- 
GH1 specific DNA sequence with a specific primer or probe which specifically 
recognizes a sequence within the 5' flanking region of SEQ ID NO: 3 or the 3' 
flanking region of SEQ ID NO: 4 of EE-GH1, in seed samples. 

19. A method for screening seeds for the presence of EE-GH1, which method 
comprises detecting an EE-GH1 specific DNA sequence with a specific primer or 
probe which specifically recognizes a sequence within the 5' flanking region of 
SEQ ID NO: 3 or the 3' flanking region of SEQ ID NO: 4 of EE-GH1, in samples 
of seed lots. 

20. A seed deposited at the ATCC under accession number PTA-3343. 

21. A cotton seed comprising elite event EE-GH1, reference seed comprising said event 
having been deposited at the ATCC under accession number PTA-3343. 

22. A cotton plant, cell or tissue or plant material thereof comprising elite event EE- 
GH1, derived from the seed of claim 21. 

23. Transgenic cotton plants, seeds, cells or tissues, the genomic DNA of which 
comprises a transgene integrated into the chromosomal DNA in a region which 
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10 



comprises a sequence of at least 40 bp which hybridizes under stringent conditions 
with a sequence which is complementary to the sequence of SEQ ID NO: 5. 

24. A process for producing a transgenic cotton plant or cell or tissue of a cotton plant, 
said process comprising introducing a recombinant DNA molecule into a region of 
cotton chromosomal DNA corresponding to a sequence of at least 40 bp that 
hybridizes under stringent conditions with a sequence that is complementary to the 
sequence of SEQ ID NO: 5, and, optionally, regenerating a cotton plant from the 
transformed cotton cell or tissue. 

25. The process of claim 24, wherein said recombinant DNA molecule comprises an 
herbicide resistance gene. 



26. The plant or cell or tissue of a cotton plant obtained by the process of claims 24 or 
15 25. 

27. A transgenic cotton plant or seed, cells or tissues thereof comprising 

(i) event EE-GH1 in its genome ;or 

(ii) event EE-GH1 with the proviso that the bar gene used in the event is 
20 substituted with a nucleic acid sequence that hybridizes to the complement of the 

bar gene under stringent conditions. 

28. A DNA molecule which can be amplified from a nucleic acid sample of cotton, 
using a set of primers the first of which comprises a sequence of 15 to 30 

25 nucleotides complementary to a sequence within SEQ ID NO:l or SEQ ID NO:4, 

the second of which comprises a sequence complementary to the foreign DNA 
present in said nucleic acid sample of corn. 

29. The DNA molecule of claim 28, which can be amplified from a nucleic acid sample 
30 of cotton, using a set of primers, the first of which comprising SEQ ID NO:2, the 

second of which comprising SEQ ID NO:3. 
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30. A detection kit comprising the DNA of claim 28 or 29. 

31. A DNA molecule isolated from corn tissue, comprising the sequence of SEQ ID 
NO:l or SEQ ID NO:4 

32. A DNA molecule comprising a sequence which is identical to or complementary to 
the sequence of 9 nucleotides of plant DNA and foreign DNA at each side 
contiguous with the insertion site contained in SEQ ID NO: 1 or SEQ ID NO: 4. 
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1 2 3 4 5 6 7 




445bp (endo control) 
269bp (EE-GH1) 



Figure 1 
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SEQUENCE LISTING 
<110> Aventis CropScience N.V. 

<12 0> Herbicide tolerant cotton plants and methods for producing and 
identifying same 

<130> EE-GH1 

<150> US 09/921,922 
<151> 2001-08-06 

<160> 18 

<170> Patentln version 3.1 

<210> 1 

<211> 850 

<212> DNA 

<213> Artificial: sequence comprising 5' flanking region 

<400> 1 



ciciciyyy y a. L_y 


aaattaaata 


t tacct tatc 


aacaaaagga 


gttgtagctc 


atggaacaac 


60 


aatagtcttt 


tccacggaaa 


cctagatgat 


gtttctccaa 


tgcttgataa 


atctttaaca 


120 


ttgtcatcat 


aagttgcaac 


ctcatgtttc 


acacaagcat 


caatcaaatg 


ttgatcttca 


180 


ttactaaaat 


gtgcttgatc 


cttccttaca 


caaatctacc 


tatgttgtgg 


tattttgttc 


240 


tattcatcat 


tctaacaagt 


tttgcaattg 


agttgaactt 


cttccaatct 


cgtatcagcc 


300 


tataatagtg 


gggtctaata 


tgtccatttt 


tcccacaata 


atgacatata 


atctttctaa 


360 


agcttttatt 


ctctgcctta 


tgatgaaaag 


aacccaaatc 


tttaacttta 


acaaaaataa 


420 


gatgagcgat 


aggttcttca 


cctttattga 


tgtaaccaag 


tcctctatgg 


catggttcaa 


480 


ttctcattga 


agccaaaatt 


tcatgaaact 


tctcacattg 


gcctctaaac 


ttcttcaaga 


540 


tagcctttgc 


accatctagc 


tcactcttgg 


ttgttttcaa 


aacatcatcc 


gtttcttgga 


600 


ccacaatttt 


gagcttttca 


ttttctattt 


tgaggataat 


agtttattcc 


ctcaaggaac 


660 


tattcaactg 


agcttaacag 


tactcggccg 


tcgaccgcgg 


tacccggaat 


tccaatccca 


720 


caaaaatctg 


agcttaacag 


cacagttgct 


cctctcagag 


cagaatcggg 


tattcaacac 


780 


cctcatatca 


actactacgt 


tgtgtataac 


ggtccacatg 


ccggtatata 


cgatgactgg 


840 


ggttgtacaa 












850 



<210> 2 
<211> 20 
<212> DNA 

<213> Artificial: primer GHI06 
<400> 2 

iitxjcaccatc tagctcactc 
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<210> 3 
<211> 21 
<212> DNA 

<213> Artificial: primer GHI05 
<400> 3 

ggaccgttat acacaacgta g 



<210> 4 

<211> 426 

<212> DNA 

<213> Artificial: sequence comprising 3 ' flanking region 

<400> 4 



gattagagtc 


ccgcaattat 


acatttaata 


cgcgatagaa 


aacaaaatat 


agcgcgcaaa 


60 


ctaggataaa 


ttatcgcgcg 


cggtgtcatc 


tatgttacta 


gatcgggaag 


atcctctaga 


120 


gtcgacctgc 


aggcatgcaa 


gcttagatcc 


atggagccat 


ttacaattga 


atatatcctc 


180 


caaatattta 


aaaagaatat 


caccattatc 


cgaatcttct 


ttaaaatctg 


ttagaacacg 


240 


gtttggaata 


gtggtagtaa 


aagtaacata 


gttgctcgca 


tcttgatcta 


cattaaactt 


300 


tcttcatcac 


tccaagtgat 


tgtaaatgac 


ttctatttct 


tcttagtatt 


agcacattct 


360 


aattttaagt 


gaaacaatcc 


cttacattca 


taacattgaa 


tatccttcta 


tcatctcaca 


420 


gcacga 
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<210> 5 

<211> 961 

<212> DNA 

<213> Gossypium hirsutum: insertion region 

<400> 5 



aaaggggatg 


agattgaatg 


ttaccttatc 


aacaaaagga 


gttgtagctc 


atggaacaac 


60 


aatagtcttt 


tccacggaaa 


cctagatgat 


gtttctccaa 


tgcttgataa 


atctttaaca 


120 


ttgtcatcat 


aagttgcaac 


ctcatgtttc 


acacaagcat 


caatcaaatg 


ttgatcttca 


180 


ttactaaaat 


gtgcttgatc 


cttccttaca 


caaatctacc 


tatgttgtgg 


tattttgttc 


240 


tattcatcat 


tctaacaagt 


tttgcaattg 


agttgaactt 


cttccaatct 


cgtatcagcc 


300 


tataatagtg 


gggtctaata 


tgtccatttt 


tcccacaata 


atgacatata 


atctttctaa 


360 


agcttttatt 


ctctgcctta 


tgatgaaaag 


aacccaaatc 


tttaacttta 


acaaaaataa 


420 


gatgagcgat 


aggttcttca 


cctttattga 


tgtaaccaag 


tcctctatgg 


catggttcaa 


480 


ttctcattga 


agccaaaatt 


tcatgaaact 


tctcacattg 


gcctctaaac 


ttcttcaaga 


540 


tagcctttgc 


accatctagc 


tcactcttgg 


ttgttttcaa 


aacatcatcc 


gtttcttgga 


600 


ccacaatttt 


gagcttttca 


ttttctattt 


tgaggataat 


agtttattcc 


ctcaaggaac 


660 
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tattcaactg 


agcttaaatc 


tcaatttttt 


ttaacatatg 


actataagta 


tcctccaaat 


720 


atttaaaaag 


aatatcacca 


ttatccgaat 


cttctttaaa 


atctgttaga 


acacggtttg 


780 


gaatagtggt 


agtaaaagta 


acatagttgc 


tcgcatcttg 


atctacatta 


aactttcttc 


84 0 


atcactccaa 


gtgattgtaa 


atgacttcta 


tttcttctta 


gtattagcac 


attctaattt 


900 


taagtgaaac 


aatcccttac 


attcataaca 


ttgaatatcc 


ttctatcatc 


tcacagcacg 


960 


a 












961 



<210> 6 

<211> 9555 

<212> DNA 

<213> Artificial: plasmid pGSV71 

<400> 6 



agattcgaag 


ctcggtcccg 


tgggtgttct 


gtcgtctcgt 


tgtacaacga 


aatccattcc 


60 


cattccgcgc 


tcaagatggc 


ttcccctcgg 


cagttcatca 


gggctaaatc 


aatctagccg 


120 


acttgtccgg 


tgaaatgggc 


tgcactccaa 


cagaaacaat 


caaacaaaca 


tacacagcga 


180 


cttattcaca 


cgcgacaaat 


tacaacggta 


tatatcctgc 


cagtactcgg 


ccgtcgaccg 


240 


cggtacccgg 


aattccaatc 


ccacaaaaat 


ctgagcttaa 


cagcacagtt 


gctcctctca 


300 


gagcagaatc 


gggtattcaa 


caccctcata 


tcaactacta 


cgttgtgtat 


aacggtccac 


360 


atgccggtat 


atacgatgac 


tggggttgta 


caaaggcggc 


aacaaacggc 


gttcccggag 


420 


ttgcacacaa 


gaaatttgcc 


actattacag 


aggcaagagc 


agcagctgac 


gcgtacacaa 


480 


caagtcagca 


aacagacagg 


ttgaacttca 


tccccaaagg 


agaagctcaa 


ctcaagccca 


540 


agagctttgc 


taaggcccta 


acaagcccac 


caaagcaaaa 


agcccactgg 


ctcacgctag 


600 


gaaccaaaag 


gcccagcagt 


gatccagccc 


caaaagagat 


ctcctttgcc 


ccggagatta 


660 


caatggacga 


tttcctctat 


ctttacgatc 


taggaaggaa 


gttcgaaggt 


gaaggtgacg 


720 


acactatgtt 


caccactgat 


aatgagaagg 


ttagcctctt 


caatttcaga 


aagaatgctg 


780 


acccacagat 


ggttagagag 


gcctacgcag 


caggtctcat 


caagacgatc 


tacccgagta 


840 


acaatctcca 


ggagatcaaa 


taccttccca 


agaaggttaa 


agatgcagtc 


aaaagattca 


900 


ggactaattg 


catcaagaac 


acagagaaag 


acatatttct 


caagatcaga 


agtactattc 


960 


cagtatggac 


gattcaaggc 


ttgcttcata 


aaccaaggca 


agtaatagag 


attggagtct 


1020 


ctaaaaaggt 


agttcctact 


gaatctaagg 


ccatgcatgg 


agtctaagat 


tcaaatcgag 


1080 


gatctaacag 


aactcgccgt 


gaagactggc 


gaacagttca 


tacagagtct 


tttacgactc 


1140 


aatgacaaga 


agaaaatctt 


cgtcaacatg 


gtggagcacg 


acactctggt 


ctactccaaa 


1200 


aafcgtcaaag atacagtctc 


agaagaccaa 


agggctattg 


agacttttca 


acaaaggata 


1260 
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atttcgggaa 


acctcctcgg 


attccattgc 


ccagctatct 


gtcacttcat 


cgaaaggaca 


1320 


gtagaaaagg 


aaggtggctc 


ctacaaatgc 


catcattgcg 


ataaaggaaa 


ggctatcatt 


1380 


caagatgcct 


ctgccgacag 


tggtcccaaa 


gatggacccc 


cacccacgag 


gagcatcgtg 


1440 


gaaaaagaag 


acgttccaac 


cacgtcttca 


aagcaagtgg 


attgatgtga 


catctccact 


1500 


gacgtaaggg 


atgacgcaca 


atcccactat 


ccttcgcaag 


acccttcctc 


tatataagga 


1560 


agttcatttc 


atttggagag 


gacacgctga 


aatcaccagt 


ctctctctat 


aaatctatct 


1620 


ctctctctat 


aaccatggac 


ccagaacgac 


gcccggccga 


catccgccgt 


gccaccgagg 


1680 


cggacatgcc 


ggcggtctgc 


accatcgtca 


accactacat 


cgagacaagc 


acggtcaact 


1740 


tccgtaccga 


gccgcaggaa 


ccgcaggagt 


ggacggacga 


cctcgtccgt 


ctgcgggagc 


1800 


gctatccctg 


gctcgtcgcc 


gaggtggacg 


gcgaggtcgc 


cggcatcgcc 


tacgcgggcc 


1860 


cctggaaggc 


acgcaacgcc 


tacgactgga 


cggccgagtc 


gaccgtgtac 


gtctcccccc 


1920 


gccaccagcg 


gacgggactg 


ggctccacgc 


tctacaccca 


cctgctgaag 


tccctggagg 


1980 


cacagggctt 


caagagcgtg 


gtcgctgtca 


tcgggctgcc 


caacgacccg 


agcgtgcgca 


2040 


tgcacgaggc 


gctcggatat 


gccccccgcg 


gcatgctgcg 


ggcggccggc 


ttcaagcacg 


2100 


ggaactggca 


tgacgtgggt 


ttctggcagc 


tggacttcag 


cctgccggta 


ccgccccgtc 


2160 


cggtcctgcc 


cgtcaccgag 


atctgagatc 


acgcgttcta 


ggatccgaag 


cagatcgttc 


2220 


aaacatttgg 


caataaagtt 


tcttaagatt 


gaatcctgtt 


gccggtcttg 


cgatgattat 


2280 


catataattt 


ctgttgaatt 


acgttaagca 


tgtaataatt 


aacatgtaat 


gcatgacgtt 


2340 


atttatgaga 


tgggttttta 


tgattagagt 


cccgcaatta 


tacatttaat 


acgcgataga 


2400 


aaacaaaata 


tagcgcgcaa 


actaggataa 


attatcgcgc 


gcggtgtcat 


ctatgttact 


2460 


agatcgggaa 


gatcctctag 


agtcgacctg 


caggcatgca 


agcttagatc 


catggagcca 


2520 


tttacaattg 


aatatatcct 


gccgccgctg 


ccgctttgca 


cccggtggag 


cttgcatgtt 


2580 


ggtttctacg 


cagaactgag 


ccggttaggc 


agataatttc 


cattgagaac 


tgagccatgt 


2640 


gcaccttccc 


cccaacacgg 


tgagcgacgg 


ggcaacggag 


tgatccacat 


gggactttta 


2700 


aacatcatcc 


gtcggatggc 


gttgcgagag 


aagcagtcga 


tccgtgagat 


cagccgacgc 


2760 


accgggcagg 


cgcgcaacac 


gatcgcaaag 


tatttgaacg 


caggtacaat 


cgagccgacg 


2820 


ttcacggtac 


cggaacgacc 


aagcaagcta 


gcttagtaaa 


gccctcgcta 


gattttaatg 


2880 


cggatgttgc 


gattacttcg 


ccaactattg 


cgataacaag 


aaaaagccag 


cctttcatga 


2940 


tatatctccc 


aatttgtgta 


gggcttatta 


tgcacgctta 


aaaataataa 


aagcagactt 


3000 


ga"cctgatag 


tttggctgtg 


agcaattatg 


tgcttagtgc 


atctaacgct 


tgagttaagc 


3060 


cgcgccgcga 


agcggcgtcg 


gcttgaacga 


attgttagac 


attatttgcc 


gactaccttg 


3120 
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gtgatctcgc 


ctttcacgta 


gtggacaaat 


tcttccaact 


gatctgcgcg 


cgaggccaag 


3180 


cgatcttctt 


cttgtccaag 


ataagcctgt 


ctagcttcaa 


gtatgacggg 


ctgatactgg 


3240 


gccggcaggc 


gctccattgc 


ccagtcggca 


gcgacatcct 


tcggcgcgat 


tttgccggtt 


3300 


actgcgctgt 


accaaatgcg 


ggacaacgta 


agcactacat 


ttcgctcatc 


gccagcccag 


3360 


tcgggcggcg 


agttccatag cgttaaggtt 


tcatttagcg 


cctcaaatag 


atcctgttca 


3420 


ggaaccggat 


caaagagttc 


ctccgccgct 


ggacctacca 


aggcaacgct 


atgttctctt 


3480 


gcttttgtca 


gcaagatagc 


cagatcaatg 


tcgatcgtgg 


ctggctcgaa 


gatacctgca 


3540 


agaatgtcat 


tgcgctgcca 


ttctccaaat 


tgcagttcgc 


gcttagctgg 


ataacgccac 


3600 


ggaatgatgt 


cgtcgtgcac 


aacaatggtg 


acttctacag 


cgcggagaat 


ctcgctctct 


3660 


ccaggggaag 


ccgaagtttc 


caaaaggtcg 


ttgatcaaag 


ctcgccgcgt 


tgtttcatca 


3720 


agccttacgg 


tcaccgtaac 


cagcaaatca 


atatcactgt 


gtggcttcag 


gccgccatcc 


3780 


actgcggagc 


cgtacaaatg 


tacggccagc 


aacgtcggtt 


cgagatggcg 


ctcgatgacg 


3840 


ccaactacct 


ctgatagttg 


agtcgatact 


tcggcgatca 


ccgcttccct 


catgatgttt 


3900 


aactttgttt 


tagggcgact 


gccctgctgc 


gtaacatcgt 


tgctgctcca 


taacatcaaa 


3960 


catcgaccca 


cggcgtaacg 


cgcttgctgc 


ttggatgccc 


gaggcataga 


ctgtacccca 


4020 


aaaaaacagt 


cataacaagc 


catgaaaacc 


gccactgcgc 


cgttaccacc 


gctgcgttcg 


4080 


gtcaaggttc 


tggaccagtt 


gcgtgagcgc 


atacgctact 


tgcattacag 


cttacgaacc 


4140 


gaacaggctt 


atgtccactg 


ggttcgtgcc 


ttcatccgtt 


tccacggtgt 


gcgtcacccg 


4200 


gcaaccttgg 


gcagcagcga 


agtcgaggca 


tttctgtcct 


ggctggcgaa 


cgagcgcaag 


4260 


gtttcggtct 


ccacgcatcg 


tcaggcattg 


gcggccttgc 


tgttcttcta 


cggcaagtgc 


4320 


tgtgcacgga 


tctgccctgg 


cttcaggaga 


tcggaagacc 


tcggccgtcc 


gggcgcttgc 


4380 


cggtggtgct 


gaccccggat 


gaagtggttc 


gcatcctcgg 


ttttctggaa 


ggcgagcatc 


4440 


gtttgttcgc 


ccagcttctg 


tatggaacgg 


gcatgcggat 


cagtgagggt 


ttgcaactgc 


4500 


gggtcaagga 


tctggatttc 


gatcacggca 


cgatcatcgt 


gcgggagggc 


aagggctcca 


4560 


aggatcgggc 


cttgatgtta 


cccgagagct 


tggcacccag 


cctgcgcgag 


cagggatcga 


4620 


tccaacccct 


ccgctgctat 


agtgcagtcg 


gcttctgacg 


ttcagtgcag 


ccgtcttctg 


4680 


aaaacgacat 


gtcgcacaag 


tcctaagtta 


cgcgacaggc 


tgccgccctg 


cccttttcct 


4740 


ggcgttttct 


tgtcgcgtgt 


tttagtcgca 


taaagtagaa 


tacttgcgac 


tagaaccgga 


4800 


gacattacgc 


catgaacaag 


agcgccgccg 


ctggcctgct 


gggctatgcc 


cgcgtcagca 


4860 


ccgacgacca ggacttgacc aaccaacggg ccgaactgca 


cgcggccggc 


tgcaccaagc 


4920 
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tattttccaa 


gaagat cacc 


qqcaccaQQC 


gcgaccgccc 


ggagctggcc 


aggatgettg 


4980 


arracctaca 

c*. ca. v.- i— '"^3 


ccctggcgac 


QttQtQacacr 


tgaccaggct 


agaccgcctg 


gcccgcagca 


5040 


cccgcgacct 


actggacatt 


gccgagcgca 


tecaqqaqqe 


cggcgcgggc 


ctgcgtagcc 


5100 


tLdC3Ca.GB.OCC 

~n V* tAH w w 


QtQQQCCQaC 
y »-yyy^-' s -'y«-*- % - 


accaccacgc 


cqqccgqccq 


catggtgttg 


accgtgttcg 


5160 


rraor attar 


caaattcaaa 


cgtt ccctaa 


tcatcgaccg 


cacccggagc 


gggegegagg 


5220 


w> Vp* t* 33 33 ^ 




aagtttggcc 


cccgccctac 


cctcaccccg 


gcacagatcg 


5280 


*w a v— —4 


cgagc tgatc 


qaccaqqaaq 


gccgcaccgt 


gaaagaggcg 


gctgcactgc 


5340 


t~ 1" ci ci r* a tar* a 


tcactcaacc 


CtQtaCCQCCf 


cacttgagcg 


cagegaggaa 


gtgacgccca 


5400 


uuy ciyy ^v- y 




gcct tccgtg 


aggaegcatt 


gaccgaggcc 


gacgccctgg 


5460 




aaataaacac 


caagaggaac 


aagcatgaaa 


ccgcaccagg 


aeggecagga 


5520 




ttcattacca 

V— o w c*. ^ ^— c* • 


aagagat cga 


qqcqqaqatq 


atcqcqqccq 

w» zj ZJ ZJ —J 


ggtacgtgtt 


5580 


r* f*r ^ ri r* r* rr r* r* r* 

y y y 


apnra ratrt 
y ^y ^ *-* ^y t * 


caaccotaco 

\A *w *«j w 33 v 33 


getgeatgaa 


atcctggccg 


gtttgtctga 


5640 


I— y . d ciy l. y 


y^yy^^* u yy ^ 


cggccagc t t 


qqccqctqaa 


gaaaccgagc 


gccgccgtct 


5700 


a a a a a crcil - tTA 
ctaactdyy Ly a 


tatatattta 


agt aaaacag 


ettgegtcat 


qcqqtcqctq 

ZD ZJ ZJ ^ ZJ ZJ 


cgtatatgat 


5760 


y ^-y c*. «y i— c* 


aataaacaaa 


tacgcaaggg 


gaacgcatga 


aggttatege 


tgtacttaac 


5820 


^ayaaayyuy 


nnt raaacaa 
yy u *- a yy v ' cit * 


gacgaccat c 


gcaacccatc 


tagcccgcgc 


cctgcaactc 


5880 


y u ^-yyyy^^y 


atattctatt 

d 1m- 33 w w- w» w w- w- 


agt cga t tec 


gatccccagg 


gcagtgcccg 


cgattgggcg 


5940 


y LL y *-y <-yyy 


Cl^H d. Vta-» 


gc taaccgt t 


qtcqqcatcq 


accgcccgac 


gattgacege 


6000 


nafataaa cici 
y ci v_»y uy aay y 


ccatcaacca 

V-» C4. O V-« V** ^— * 3^ 


gegegae t tc 


qtaqtqatcq 

33 *■* 33 33 ^3 


acqqaqcgcc 

zj zj zj ~j 


ccaggcggcg 


6060 


aa c 1 1 acfc* t a 


tgtccgcgat 


caaggcagcc 


gacttcgtgc 


tgattccggt 


gcagccaagc 


6120 




tatgggccac 


cgccgacctg 


qtqqaqctqq 


ttaagcagcg 


cattgaggtc 


6180 


acaaataaaa 

w 33 33 


ggctacaagc 


qqcctttqtc 

33 33 www ^3 w 


qtqtcqcgqg 

3 ZJ ZJ ZJ ZJ ZJ 


cgatcaaagg 


cacgcgcatc 


6240 


aacaataaacr 
yy^yy ^-y ^-y y 


ttqccQaqqc 


QctciQCcqqcr 


tacgagctgc 


ccattcttga 


gtcccgtatc 


6300 


acgcagcgcg 


tgagctaccc 


aggcactgcc 


gccgccggca 

ZJ ZJ — ' — ) 


caaccgttct 


tgaatcagaa 


6360 


cccaaaaaco 
v -y c *yyy*-'y 


acgctgcccg 


caaQCftccacr 

33 33 33 w 33 


qcqctqqccq 

ZD ZJ ZJ ZJ ZJ 


ctgaaattaa 


atcaaaactc 


6420 


at t tgagtta 


atgaggtaaa 


gagaaaatga 


gcaaaagcac 


aaacaegcta 


agtgccggcc 


6480 


at c caaciccic 


acgcagcagc 


aaggc tgcaa 


cqttqqccaq 

w^ 33 w w 33 33 w- w^ w*. 33 


cctggcagac 


acgccagcca 


6540 


tgaagcgggt 


caactttcag 


ttgeeggegg 


aggatcacac 


caagctgaag 


atgtacgcgg 


6600 


tacgccaagg 


caagaccatt 


accgagctgc 


tatctgaata 


catcgcgcag 


ctaccagagt 


6660 


aaatgagcaa 


atgaataaat 


gagtagatga 


attttagegg 


ctaaaggagg 


cggcatggaa 


6720 


aatcaagaac 


aaccaggcac 


cgacgccgtg 


gaatgcccca 


tgtgtggagg 


aacgggcggt 


6780 
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tggccaggcg taagcggctg ggttgtctgc cggccctgca atggcactgg aacccccaag 6840 

cccgaggaat cggcgtgacg gtcgcaaacc atccggcccg gtacaaatcg gcgcggcgct 6900 

gggtgatgac ctggtggaga agttgaaggc cgcgcaggcc gcccagcggc aacgcatcga 6960 

ggcagaagca cgccccggtg aatcgtggca agcggccgct gatcgaatcc gcaaagaatc 7020 

ccggcaaccg ccggcagccg cgtgcgccgtc gattaggaag ccgcccaagg gcgacgagca 7080 

accagatttt ttcgttccga tgctctatga cgtgggcacc cgcgatagtc gcagcatcat 7140 

ggacgtggcc gttttccgtc tgtcgaagcg tgaccgacga gctggcgagg tgatccgcta 7200 

cgagcttcca gacgggcacg tagaggtttc cgcagggccg gccggcatgg ccagtgtgtg 7260 

ggattacgac ctggtactga tggcggtttc ccatctaacc gaatccatga accgataccg 7320 

ggaagggaag ggagacaagc ccggccgcgt gttccgtcca cacgttgcgg acgtactcaa 73 80 

gttctgccgg cgagccgatg gcggaaagca gaaagacgac ctggtagaaa cctgcattcg 7440 

gttaaacacc acgcacgttg ccatgcagcg tacgaagaag gccaagaacg gccgcctggt 7500 

gacggtatcc gagggtgaag ccttgattag ccgctacaag atcgtaaaga gcgaaaccgg 7560 

gcggccggag tacatcgaga tcgagctagc tgattggatg taccgcgaga tcacagaagg 7620 

caagaacccg gacgtgctga cggttcaccc cgattacttt ttgatcgatc ccggcatcgg 7680 

ccgttttctc taccgcctgg cacgccgcgc cgcaggcaag gcagaagcca gatggttgtt 7740 

caagacgatc tacgaacgca gtggcagcgc cggagagttc aagaagttct gtttcaccgt 7800 

gcgcaagctg atcgggtcaa atgacctgcc ggagtacgat ttgaaggagg aggcggggca 7860 

ggctggcccg atcctagtca tgcgctaccg caacctgatc gagggcgaag catccgccgg 7920 

ttcctaatgt acggagcaga tgctagggca aattgcccta gcaggggaaa aaggtcgaaa 7 980 

aggtctcttt cctgtggata gcacgtacat tgggaaccca aagccgtaca ttgggaaccg 8040 

gaacccgtac attgggaacc caaagccgta cattgggaac cggtcacaca tgtaagtgac 8100 

tgatataaaa gagaaaaaag gcgatttttc cgcctaaaac tctttaaaac ttattaaaac 8160 

tcttaaaacc cgcctggcct gtgcataact gtctggccag cgcacagccg aagagctgca 8220 

aaaagcgcct acccttcggt cgctgcgctc cctacgcccc gccgcttcgc gtcggcctat 8280 

cgcggccgct ggccgctcaa aaatggctgg cctacggcca ggcaatctac cagggcgcgg 8340 

acaagccgcg ccgtcgccac tcgaccgccg gcgcccacat caaggcaccc tgcctcgcgc 8400 

gtttcggtga tgacggtgaa aacctctgac acatgcagct cccggagacg gtcacagctt 8460 

gtctgtaagc ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg 8520 

ggtgtcgggg cgcagccatg acccagtcac gtagcgatag cggagtgtat actggcttaa 8580 
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ctatgcggca 


tcagagcaga 


ttgtactgag 


agtgcaccat 


atgcggtgtg 


aaataccgca 


q a a n 
ob4U 


cagatgcgta 


aggagaaaat 


accgcatcag 


gcgctcttcc 


gcttcctcgc 


tcactgactc 


O / U U 


gctgcgctcg 


gtcgttcggc 


tgcggcgagc 


ggtatcagct 


cactcaaagg 


cggtaatacg 


Q "7 a c\ 


gttatccaca 


gaatcagggg 


ataacgcagg 


aaagaacatg 


tgagcaaaag 


gccagcaaaa 


Q Q O C\ 


ggccaggaac 


cgtaaaaagg 


ccgcgttgct 


ggcgtttttc 


cataggctcc 


gcccccctga 


o o o U 


cgagcatcac 


aaaaatcgac 


gctcaagtca 


gaggtggcga 


aacccgacag 


gac t at aaag 


Q Q A n 


ataccaggcg 


tttccccctg 


gaagctccct 


cgtgcgctct 


cctgttccga 


ccctgccgc t 


q n r\ r\ 


taccggatac 


ctgtccgcct 


ttctcccttc 


gggaagcgtg 


gcgctttctc 


atagctcacg 


n n c n 


ctgtaggtat 


ctcagttcgg 


tgtaggtcgt 


tcgctccaag 


ctgggctgtg 


tgcacgaacc 




ccccgttcag 


cccgaccgct 


gcgccttatc 


cggtaactat 


cgtcttgagt 


ccaacccggt 


9180 


aagacacgac 


ttatcgccac 


tggcagcagc 


cactggtaac 


aggattagca 


gagcgaggta 


Q o a n 


tgtaggcggt 


gctacagagt 


tcttgaagtg 


gtggcctaac 


tacggctaca 


ctagaaggac 




agtatttggt 


atctgcgctc 


tgctgaagcc 


agttaccttc 


ggaaaaagag 


t tggt age t c 




ttgatccggc 


aaacaaacca 


ccgctggtag 


cggtggtttt 


tttgtttgca 


agcagcagat 


9420 


tacgcgcaga 


aaaaaaggat 


ctcaagaaga 


tccggaaaac 


gcaagcgcaa 


agagaaagca 


9480 


ggtagcttgc 


agtgggctta 


catggcgata 


gctagactgg 


gcggttttat 


ggacagcaag 


9540 


cgaaccggaa 


ttgcc 










9555 



<210> 7 
<211> 4182 
<212> DNA 

<213> Artificial: plasmid pRVA44 
<400> 7 

tcgcgcgttt eggtgatgae ggtgaaaacc tctgacacat gcagctcccg gagaeggtea 



60 



cagcttgtct gtaageggat geegggagea gacaagcccg teagggegeg teagegggtg 120 

ttggcgggtg teggggctgg cttaactatg eggcatcaga gcagattgta ctgagagtgc 180 

accatacctg caggcaattg gtacctacgt atgcatggcg cgccatatgc ccgggccctg 240 

tacagcggcc gcgttcctac gcagcaggtc tcatcaagac gatctacccg agtaacaatc 300 

tccaggagat caaatacctt cccaagaagg ttaaagatgc agtcaaaaga ttcaggacta 360 

attgeatcaa gaacacagag aaagacatat ttctcaagat cagaagtact attccagtat 420 

ggacgattca aggcttgett cataaaccaa ggcaagtaat agagattgga gtctctaaaa 



480 



aggtagttcc tactgaatct aaggccatgc atggagtcta agattcaaat cgaggatcta 540 



acagaactcg ccgtgaagac tggcgaacag ttcatacaga gtcttttacg actcaatgac 



600 
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aagaagaaaa 


tcttcgtcaa 


catggtggag 


cacgacactc 


tggtctactc 


caaaaatgtc 


660 


aaagatacag 


tctcagaaga 


ccaaagggct 


attgagactt 


ttcaacaaag 


gataatttcg 


720 


ggaaacctcc 


tcggattcca 


ttgcccagct 


atctgtcact 


tcatcgaaag 


gacagtagaa 


780 


aaggaaggtg 


gctcctacaa 


atgccatcat 


tgcgataaag 


gaaaggctat 


cattcaagat 


840 


gcctctgccg 


acagtggtcc 


caaagatgga 


cccccaccca 


cgaggagcat 


cgtggaaaaa 


900 


gaagacgttc 


caaccacgtc 


ttcaaagcaa 


gtggattgat 


gtgacatctc 


cactgacgta 


960 


agggatgacg 


cacaatccca 


ctatccttcg 


caagaccctt 


cctctatata 


aggaagttca 


1020 


tttcatttgg 


agaggacacg 


ctgaaatcac 


cagtctctct 


ctataaatct 


atctctctct 


1080 


ctataaccat 


ggacccagaa 


cgacgcccgg 


ccgacatccg 


ccgtgccacc 


gaggcggaca 


1140 


tgccggcggt 


ctgcaccatc 


gtcaaccact 


acatcgagac 


aagcacggtc 


aacttccgta 


1200 


ccgagccgca 


ggaaccgcag 


gagtggacgg 


acgacctcgt 


ccgtctgcgg 


gagcgctatc 


1260 


cctggctcgt 


cgccgaggtg 


gacggcgagg 


tcgccggcat 


cgcctacgcg 


ggcccctgga 


1320 


aggcacgcaa 


cgcctacgac 


tggacggccg 


agtcgaccgt 


gtacgtctcc 


ccccgccacc 


1380 


agcggacggg 


actgggctcc 


acgctctaca 


cccacctgct 


gaagtccctg 


gaggcacagg 


1440 


gcttcaagag 


cgtggtcgct 


gtcatcgggc 


tgcccaacga 


cccgagcgtg 


cgcatgcacg 


1500 


aggcgctcgg 


atatgccccc 


cgcggcatgc 


tgcgggcggc 


cggcttcaag 


cacgggaact 


1560 


ggcatgacgt 


gggtttctgg 


cagctggact 


tcagcctgcc 


ggtaccgccc 


cgtccggtcc 


1620 


tgcccgtcac 


cgagatctga 


tctcacgcgt 


ctaggatccg 


aagcagatcg 


ttcaaacatt 


1680 


tggcaataaa 


gtttcttaag 


attgaatcct 


gttgccggtc 


ttgcgatgat 


tatcatataa 


1740 


tttctgttga 


attacgttaa 


gcatgtaata 


attaacatgt 


aatgcatgac 


gttatttatg 


1800 


agatgggttt 


ttatgattag 


agtcccgcaa 


ttatacattt 


aatacgcgat 


agaaaacaaa 


1860 


atatagcgcg 


caaactagga 


taaattatcg 


cgcgcggtgt 


catctatgtt 


actagatcgg 


1920 


gaagatcctc 


tagagcgatc 


gcaagcttgg 


cgtaatcatg 


gtcatagctg 


tttcctgtgt 


1980 


gaaattgtta 


tccgctcaca 


attccacaca 


acatacgagc 


cggaagcata 


aagtgtaaag 


2040 


cctggggtgc 


ctaatgagtg 


agctaactca 


cattaattgc 


gttgcgctca 


ctgcccgctt 


2100 


tccagtcggg 


aaacctgtcg 


tgccagctgc 


attaatgaat 


cggccaacgc 


gcggggagag 


2160 


gcggtttgcg 


tattgggcgc 


tcttccgctt 


cctcgctcac 


tgactcgctg 


cgctcggtcg 


2220 


ttcggctgcg 


gcgagcggta 


tcagctcact 


caaaggcggt 


aatacggtta 


tccacagaat 


2280 


caggggataa 


cgcaggaaag 


aacatgtgag 


caaaaggcca 


gcaaaaggcc 


aggaaccgta 


2340 


aaaaggccgc 


gttgctggcg 


tttttccata 


ggctccgccc 


ccctgacgag 


catcacaaaa 


2400 


atcgacgctc 


aagtcagagg 


tggcgaaacc 


cgacaggact 


ataaagatac 


caggcgtttc 


2460 
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cccctggaag 


ctccctcgtg 


cgctctcctg 


ttccgaccct 


gccgcttacc 


ggatacctgt 


2520 


ccgcctttct 


cccttcggga 


agcgtggcgc 


tttctcaaag 


ctcacgctgt 


aggtatctca 


2580 


gttcggtgta 


ggtcgttcgc 


tccaagctgg 


gctgtgtgca 


cgaacccccc 


gttcagcccg 


2640 


accgctgcgc 


ettatceggt 


aactategtc 


ttgagtccaa 


cceggtaaga 


cacgacttat 


2700 


cgccactggc 


agcagccact 


ggtaacagga 


ttagcagagc 


gaggtatgta 


ggcggtgcta 


2760 


cagagttctt 


gaagtggtgg 


cctaactacg 


gctacactag 


aagaacagta 


tttggtatct 


2820 


gcgctctgct 


gaagccagtt 


acctteggaa 


aaagagttgg 


tagctcttga 


tccggcaaac 


2880 


aaaccaccgc 


tggtagcggt 


ggtttttttg 


tttgeaagea 


gcagattacg 


cgcagaaaaa 


2940 


aaggatctca 


agaagatcct 


ttgatctttt 


etaeggggtc 


tgacgctcag 


tggaacgaaa 


3000 


actcacgtta 


agggattttg 


gtcatgagat 


tatcaaaaag 


gatcttcacc 


tagatccttt 


3060 


taaattaaaa 


atgaagtttt 


aaatcaatct 


aaagtatata 


tgagtaaact 


tggtctgaca 


3120 


gttaccaatg 


cttaatcagt 


gaggcaccta 


tetcagegat 


ctgtctattt 


cgttcatcca 


3180 


tagttgcctg 


actccccgtc 


gtgtagataa 


ctacgatacg 


ggagggctta 


ccatctggcc 


3240 


ccagtgctgc 


aatgataccg 


cgagacccac 


gctcaccggc 


tccagattta 


tcagcaataa 


3300 


accagccagc 


eggaagggee 


gagegcagaa 


gtggtcctgc 


aactttatcc 


gcctccatcc 


3360 


agtctattaa 


ttgttgccgg 


gaagctagag 


taagtagttc 


gccagttaat 


agtttgcgca 


3420 


acgttgttgc 


cattgetaca 


ggcatcgtgg 


tgtcacgctc 


gtcgtttggt 


atggcttcat 


3480 


tcagctccgg 


ttcccaacga 


teaaggegag 


ttacatgatc 


ccccatgttg 


tgcaaaaaag 


3540 


cggttagctc 


cttcggtcct 


ccgatcgttg 


tcagaagtaa 


gttggccgca 


gtgttatcac 


3600 


tcatggttat 


ggcagcactg 


cataattctc 


ttactgtcat 


gccatccgta 


agatgetttt 


3660 


ctgtgactgg 


tgagtactca 


accaagtcat 


tctgagaata 


gtgtatgcgg 


cgaccgagtt 


3720 


gctcttgccc 


ggegtcaata 


egggataata 


ccgcgccaca 


tagcagaact 


ttaaaagtgc 


3780 


tcatcattgg 


aaaacgttct 


teggggegaa 


aactctcaag 


gatcttaccg 


ctgttgagat 


3840 


ccagttcgat 


gtaacccact 


cgtgcaccca 


actgatcttc 


agcatctttt 


actttcacca 


3900 




ataaacaaaa 

L- C* t#*- Ol tJ. U 


acaaaaaacrc 


aaaatgeege 


aaaaaaggga 


ataagggega 


3960 


caeggaaatg 


ttgaatactc 


atactcttcc 


tttttcaata 


ttattgaagc 


atttatcagg 


4020 


gttattgtct 


catgagegga 


tacatatttg 


aatgtattta 


gaaaaataaa 


caaatagggg 


4080 


ttccgcgcac 


atttccccga 


aaagtgccac 


ctgaegtcta 


agaaaccatt 


attatcatga 


4140 


cattaaccta 


taaaaatagg 


egtatcaega 


ggccctttcg 


tc 




4182 



<210> 8 
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16 



<211> 16 

<212> DNA 

<213> Artificial: primer MDB327 
<220> 

<221> misc^feature 

<222> (1) . . (16) 

<223> "n n = a,c,t or g; "s" = c or g; "w" = a or t 



<400> 8 

ntgaggwtcn wgtsat 

<210> 9 

<211> 22 

<212> DNA 

<213> Artificial: primer MLD015 

<400> 9 

tggttcctag cgtgagccag tg 22 

<210> 10 

<211> 21 

<212> DNA 

<213> Artificial: primer MLD016 

<400> 10 

agctgctgct cttgcctctg t 21 

<210> 11 

<211> 16 

<212> DNA 

<213> Artificial: primer MDB612 
<220> 

<221> misc_f eature 
<222> (1) . . (16) 

<223> "n" = a,c,t or g; "s" = c or g; M w" = a or t 



<400> 11 

ngtgctswga nawgat 16 

<210> 12 
<211> 22 
<212> DNA 

<213> Artificial: primer MDB053 
<400> 12 

catgacgtgg gtttctggca gc 22 

<210> 13 

<211> 21 

<212> DNA 

-c2a3> Artificial: primer MDB356 
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<400> 13 

aatcctgttg ccggtcttgc g 21 

<210> 14 
<211> 22 
<212> DNA 

<213> Artificial: primer DPA017 
<400> 14 

gattagagtc ccgcaattat ac 22 

<210> 15 
<211> 20 
<212> DNA 

<213> Artificial: primer MLD019 
<400> 15 

caagatgcga gcaactatgt 20 

<210> 16 
<211> 200 
<212> DNA 

<213> Artificial: sequence comprising target site deletion 
<400> 16 

tcttggacca caattttgag cttttcattt tctattttga ggataatagt ttattccctc 60 
aaggaactat tcaactgagc ttaatatctc aatttttttt aacatatgac tataagtatc 120 
ctccaaatat ttaaaaagaa tatcaccatt atccgaatct tctttaaaat ctgttagaac 180 
acggtttgga atagtggtag 200 

<210> 17 
<211> 21 
<212> DNA 

<213> Artificial: primer GHI01 
<400> 17 

aacctaggct gctgaaggag c 21 

<210> 18 
<211> 21 
<212> DNA 

<213> Artificial: primer GHI02 
<400> 18 

caactcctcc agtcatctcc g 21 
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